4 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 




iNiiiiiiiiiiiiiiiiiiiiiiiiiiiiHiiiini 



(43) International Publication Date (10) International Publication Number 

28 November 2002 (28.11.2002) pCT WO 02/095007 A2 



(51) InterDational Patent Classification'': 



(21) international Application Number: PCT/US02/t68l9 



(22) International Filing Date: 23 May 2(X)2 (23.05.2002) 



(25) Filing Language: 

(26) PublicatioD Language: 

(30) Priority Data: 

60/293,267 



CI2N (81) Designated States (nationalji AE, AG, AL, AM, AT, AU, 
AZ, BA. BB. BG, BR. BY, BZ, OA, CH. CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC. EE. ES, H, GB, GD, GE, GH, 
GM, I [R, FRJ, ID. IL, IN, IS. JP, KE. KG, KP, KR. K2, LC. 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN. MW, 
MX. MZ, NO, NZ, OM. PH, PL, PT. RO, RU. SD, SE. SG, 
SI, SK, SL, TJ, TM, TN, TR. IT, TZ. UA. UG, US, UZ. 
VN, YU, ZA, ZM, ZW. 



English 



English 



23 May 2001 (23.05.2001) US 



(63) Related by continuation (CON) or continuation>in-part 
(CIP) to earlier application: 

US 60/293,267 (CIP) 

Filed on 23 May 2001 (23.05.2001) 

(7 1 ) App lica nt (/or all designated States except US) : CORVA S 
INTERNATIONAL, INC. [US/US]; 3030 Science Park 
Road. San Diego. CA 92121 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (/br US onfy): MADISON, Edwin, 
L, [US/US]; 11005 Cedarcrest Way, San Diego. CA 
92121 (US). SEMPLE, Joseph, Edward [USAJS]; 9711 
Caminilo Pudregak, San Diego, CA 92131 (US). VLA- 
SUK, George, P. [USAJS]; 7325 Calle Luna. Carlsbad. 
CA 92009 (US). KEMP, Scott, Jeffrey {US/USJ; 7873 
Avenida Navidad. #263. San Diego, CA 92122 (US). 
KOMANDLA, Mallareddy [IN/US]; 8148 Genesse 
Avenue, #30. San Diego. CA 92122 (US). SIEV, Daniel, 
Vanna [US/US]; 10415 Westchester Avenue. San Diego. 
CA 92126 (US). 

(74) Agents: SEIDMAN, Stephanie, L. et al.; Heller Ehrnian 
White & McAuhfife LLP. 4350 La Jolla Village Drive, San 
E>iego, CA 92122-1246 (US). 



(84) Designated States (regional): ARIPO patent (GH, GM. 
KE. LS, MW, MZ. SD. SL, SZ, TZ, UG. ZM. ZW). 
Eurasian patent (AM, AZ, BY, KG, ICZ, MD, RU. TJ. TM), 
European patent (AT. BE. Cll, CY. DE. DK, ES, Fl, FR. 
GB, GR, IE, IT, LU. MC, NL, PT, SE, TR), OAPI patent 
(BF, BJ, CF, CG, CI, CM. GA, GN. GQ. GW. ML. MR, 
NE, SN, TD. TG). 

Declaration under Rule 4.17: 

— as to the applicant 's entitlement to claim the priority of the 
earlier explication (Rule 4. 1 7 (Hi)) for the following desig- 
nations AE. AG. AL. AM. AZ AU, AZ, BA, BB, BG. BR, BY, 
BZ. CA, CH, CN. CO. CR, CU. CZ, DE, DK, DM. DZ. EC, 
EE, ES, Fl, GB, GO. GE. GH. GM, HR, HU. ID. IL, IN, 
IS. JP. KE, KG. KP. KR, KZ, LC. LK, LR, LS. LT, LU, LV, 
MA. MD. MG, MK, MN, MW. MX. MZ NO. NZ, OM. Pll, 
PL, PT. RO, RU. SD, SE, SG. SI. SK. SL TJ, TM. TN. TR, 
TT, TZ, UA. UG. UZ, VN. YU, ZA, ZM. ZW, ARIPO patent 
(GH, GM, KE. LS, MW, MZ. SD, SL SZ, TZ, UG. ZM, ZW). 
Eurasian patent (AM, AZ, BY. KG, KZ. MD. RU, TJ, TM). 
European patent (AT. BE CH, CY. DE, DK. ES, FL FR, GB, 
GR. IE. IT. LU, MC, NL, PT. SE TR). OAPI patent (BF, BJ, 
CE CG, CI, CM. GA, GN. GQ. GW. ML. MR. NE SN, TD. 
TG) 

Published: 

— without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations " appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



< 



ON 



(54) Title: CONJUGATES ACTIVATED BY CELL SURFACE PROTEASES AND THERAPEUTIC USES THEREOF 

(57) Abstract: Conjugates, compositions and method for treatment, prevention, or amelioration of one or more symptoms of cell 
surface protease-relaled diseases, including MTSP-related. urokinase-type plasminogen activator (uPA) or endotheliase-relaled dis- 
eases, are provided. The conjugates for use in the compositions and methods are peptidic conjugates that contain therapeutic, in- 
cluding cytotoxic, agents. 



wo 02/095007 PCTAJS02/16819 



-1- 

CONJUGATES ACTIVATED BY CELL SURFACE PROTEASES AND 

THERAPEUTIC USES THEREOF 

RELATED APPLICATIONS 

Benefit of priority to U.S. provisional application Serial No. 60/293,267, 
filed May 23, 2001, to Edwin L. Madison, Joseph Edward Semple and George 
P. Vlasuk, entitled "CONJUGATES ACTIVATED BY CELL SURFACE PROTEASES 
5 AND THERAPEUTIC USES THEREOF" is claimed. Where permitted, the subject 
matter of the application is incorporated by reference in its entirety. 
FIELD OF THE INVENTION 

Conjugates, compositions and methods for localized delivery of 
therapeutic agents for treating a variety of disorders, such as , proliferative 
10 diseases, autoimmune diseases, infectious diseases and inflammatory diseases, 
are provided. The conjugates, which act as prodrugs, contain therapeutic 
agents and peptidic substrates that are cleaved by cell surface proteases to 
release therapeutic agents in the vicinity of the targeted cells. 
BACKGROUND 

15 Effective treatment of cancer and other proliferative diseases involves 

administration of chemotherapeutic agents, typically systemic administration. 
Typically chemotherapeutic agents are cytotoxic agents that act by inhibiting 
proliferation or other metabolic processes, so that actively proliferating and 
growing cells will be targeted by the agent. Such targeting, however, Is not 

20 highly specific, and the side-effects are often devastating. 

Thus, a goal in pharmacology is the design of specific agents that act 
with high specific activity on targeted cells or tissues. This aim is of particular 
importance, for example, in the design of agents for treatments of diseases, 
such as proliferative diseases, including neoplastic disease, and diseases of viral 

,25 origin, in which the ratio of toxic dose to therapeutic dose is generally close to 
one and the dosage must be restricted. Numerous approaches to achieving this 
goal have been developed. Among these are the use of conjugates that contain 
a targeting agent, such as an antibody and/or growth factor, and a therapeutic 
agent, that act on specific cells; the use of antisense technology that is targeted 

30 to specific genes and/or proteins; the use of genetic therapy to provide, for 
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example, correct copies of defective genes or pharmaceutically active 
compounds, and the use of toxins that are relatively non-toxic unless delivered 
intracellularly. Thus far success has been limited. There are only a limited 
number and type of potential targeting agents, and the specificity of such agents 
5 is optimal. 

Hence there is a need to develop means for delivery of therapeutic agents 
to targeted cells and tissues. Therefore, it is an object herein, among others, to 
provide methods and compounds for targeted delivery of therapeutic agents. 
SUWIMARY OF THE INVENTION 

10 Provided herein are compounds and methods for targeted delivery of 

therapeutic agents. The compounds are conjugates that contain a peptidic 
substrate for a cell surface protease, or a soluble, shed or released form thereof, 
and an agent that upon cleavage by the protease is a therapeutic agent or in a 
form that can be activated by the targeted cell or tissue or in the tocalter 

15 thereof. The agents include therapeutic agents, such as a cytotoxic agents, 
drugs, therapeutic nucleic acid moleulces, and diagnostic agents, such as 
labelled moieties and imaging agents. The cell surface proteases are proteases 
located at a cell surface and, include, but are not limited to, membrane-bound 
proteases such as membrane-bound serine proteases (SPs), including, for 

20 example, proteases designated MTSPs and endotheliases. Also contemplated 
are proteases that are located at the cell surface by virtue of a specific binding 
interaction with a receptor therefor. Included among such proteases is 
urokinase plasminogen activator (u-PA; see, e.g.. Hung (1984) Adv. Exp. Med. 
Biol. 172:281-293; Cheng et al. (1989) Gene 55:357-363) bound to urokinase 

25 plasminogen activator receptor (u-PAR). The conjugates contain one or more 
substrates for one or a plurality of cell surface proteases linked either directly or 
via 3 linker to a targeted agent, including a therapeutic agent, such as a 
cytotoxic agent. The conjugates provided herein contain the following 
components: (peptidic substrate)^, (linker)^, and (targeted agent)t in which: at 

30 least one peptidic substrate moiety is linked with or without a linker (L) to at 
least one therapeutic agent, s is 1 or more and each substrate is the same or 
different, and is typically is between 1 and 6, generally 1 , 2 or 3; q is 0 or more 



wo 02/095007 



PCT/US02/16819 



as long as cell surface protease(s} cleaves the peptidic substrate(s} and releases 
active therapeutic agent or, releases the agent in a form that is converted by the 
cell, tissue or surrounding environment to an active form, q is 0 to t, generally 1 
to 4; t is 1 or more, generally 1 or 2 and each targeted agent are the same or 
5 different; linker refers to any linker; and the targeted agent is any agent, 
typically a therapeutic agent, such as a cytotoxic agent, a nucleic acid, a 
diagnostic agent, such as an imaging agent or labeled moiety, or a drug, 
including, but not limited to, anti-tumor, anti-cancer, anti-angiogenic, pro- 
apoptotic and anti-mltotic agents or treatments. 

10 The therapeutic agents include any biologically active molecule. These 

agents include toxins, cytokines and lymphokines, growth factors, nucleic acid 
molecules, such as antisense nucleic acid, dsRNA, and DNA molecules. The 
therapeutic agents include those that are active intracetlularly, such as 
cytotoxins, or extracellularly, such as modulators of the activity of extracellular 

15 receptors. When in the conjugates the therapeutic agents are substantially 

inactive, and when cleaved are released in active form or in a form that can be 
activated by the targeted cell or tissue or environment thereof. 

In an exemplary embodiment, the conjugates for use in the methods and 
compositions provided herein can be represented by the formula: 

20 (peptide')s-(Iinker)q-(therapeutic agent)^ 

or a derivative thereof, where peptide' is a peptidic substrate for a cell surface 
protease; s is greater than or equal to 1 , or is 1 to 6, or is 1 or 2, or is 1 ; linker 
Is any linker; q is greater than or equal to 0, or is 0 to 4, or is 0 or 1 ; the 
therapeutic agent is, for example, a cytotoxic agent, including, but not limited 

25 to, an anti-tumor, anti-angiogenic, anti-cancer, pro-apoptotic and anti-mitotic 
agents; and t is 1 or more, or is 1 or 2. In these conjugates, the therapeutic 
agent is covalently attached, optionally via a linker L, to either the C-terminus or 
the N-terminus of the peptidic substrate. 

In certain embodiments, peptide' is a substrate for a cell surface protease 

30 whereby, upon action of the protease, the conjugate, which is substantially 
inactive, is cleaved at a point on the peptidic substrate chain to release a 
compound of the formula: 
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{peptide*)5-(Iinker)p-{therapeutic agentit 
or a derivative thereof, tliat exhibits therapeutic activity in vitro and/or in vivo. 
in these conjugates, the therapeutic agent is, for example, a cytotoxic agent, 
and peptide" is a truncated version of peptide' resulting from cleavage at the PI- 
5 .PI' bond. 

The conjugates can be used to target and deliver the targeted agents to 
specific cells, and hence can be used for the treatment any diseases that are 
associated with cells or tissues that express a cell surface protease, including 
cell-associated and cell-localized proteases. The cells on which or near which 

10 such proteases are expressed are not necessarily involved in the disease or 
disease process, but are present and can serve to present the protease, which 
cleaves the targeted conjugate. 

Methods of treatment of diseases associated with cells or tissues that 
express a cell surface protease, including cell-associated and cell-localized 

15 proteases. The diseases include, but not limited to, proliferative diseases, 
autoimmune diseases, infectious diseases and inflammatory diseases. For 
example, diseases Include e, but are not limited to, rheumatoid arthritis, lupus, 
multiple sclerosis, psoriasis, diabetic retinopathies, other ocular disorders, 
including recurrence of pterygii, scarring excimer laser surgery and glaucoma 

20 filtering surgery, various disorders of the anterior eye, cardiovascular disorders, 
restenosis, chronic inflammatory diseases, wounds, circulatory disorders, crest 
syndromes, bacterial infections, viral diseases, includuing AIDS, dermatological 
disorders, and cancer, including solid neoplasms and vascular tumors, including, 
but are not limited to, lung, colon, esophageal, breast, ovarian and prostate 

25 cancers. 

Also provided are methods for identifying proteases to target conjugates 
for treatment of diseases. The methods involve identifying cell-surface 
protease-associated disease by identifying a cell involved in the disease process 
or a cell in the vicinity of the cell involved in the disease process; and 
30 identifying a cell surface protease on the cell. Conjugates that target such 
proteases as provided herein can then be prepared. 
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DESCRIPTION OF THE FIGURES 

Figures 1-5 provide in vitro CT50 (time for 50% cleavage) (min) for 
exemplary conjugates provided herein: A = 0.1-25 min; B = 25-100 min; C = 
100-250 min; D = >250 min. 
5 Figure 1 shows exemplary doxorubicin conjugates provided herein and in 

vitro CT50 (min) data for cleavage of the conjugates by MTSPl . 

Figure 2 shows exemplary doxorubicin conjugates provided herein and in 
vitro CT50 (min) data for cleavage of the conjugates by u-PA- 

Figure 3 shows exemplary taxol conjugates provided herein and in vitro 
10 CT50 (min) data for cleavage of the conjugates by MTSPl . 

Figure 4 shows exemplary taxol conjugates provided herein and in vitro 
CT50 (min) data for cleavage of the conjugates by u-PA. 

Figure 5 shows exemplary doxorubicin and taxol conjugates provided 
herein and in vitro CTgo (min) data for cleavage of the conjugates by ET1 
1 5 (endotheliase 1 ) . 

DETAILED DESCRIPTION OF EMBODIMENTS 
A. Definitions 

Unless defined otherwise, all technical and scientific terms used herein 
have the same meaning as is commonly understood by one of skill in the art to 
20 which the invention(s) belong. All patents, patent applications, published 

applications and publications, Genbank sequences, websites and other published 
materials referred to throughout the entire disclosure herein, unless noted 
otherwise, are incorporated by reference in their entirety. In the event that there 
are a plurality of definitions for terms herein, those in this section prevail. 
25 Where reference is made to a URL or other such indentifier or address, It 

understood that such identifiers can change and particular information on the 
internet can come and go, but equivalent information can be found by searching 
the internet. Reference thereto evidences the availability and public 
dissemination of such information. 
30 As used herein, a targeted agent is any agent intended for targeted 

delivery and includes therapeutic agents and diagnostic agents and any other 
agent intended for targeted delivery. 
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As used herein, targeted delivery means delivery to a selected cell or 
tissue that expresses a protease that releases the targeted agent. Such delivery 
does not have to be exclusively to such selected cell or tissue, but must include 
It, and generally deliveries higher amounts to such selected cells or tissues. 
5 Delivery Includes introduction into a cell or tissue or binding to the cell or tissue 
or release in the vicinity of the cell or tissue. For example, in some instances, a 
tumor induces production of proteases, receptors, co-factors or substrates 
asssociated with the stroma; delivery, thus, includes targeting such induced 
stromal activities, such as proteases, receptors and/or enzyme co-factors, in 
10 invading cells or cells in the tumor that is targeted. 

As used herein, therapeutic index is the ratio of LDgo/EDgQ. 
As used herein, a therapeutic agent is any drug or other agent that is 
intended for delivery to a targeted cell or tissue, such as proliferating cells, 
including tumor cells and cells Involved in a proliferative, typically an 
15 undesirable, response. Therapeutic agents, include, but are not limited to, anti- 
cancer agents, anti-angiogenic agents, pro-apoptotic agents, anti-mitotic growth 
factors, cytokines, such as tumor necrosis factors and interleukins, and 
cytotoxic agents and other such agents as described herein and known to those 
of skill in the art. Therapeutic agents include those that are active upon 
20 internalization and also those that act extracellularly, such modulators of the 
activities of certain cell surface receptors, such as G proteins that transduce 
extracellular signals. 

As used herein, an inactive therapeutic agent is a therapeutic agent that 
is conjugated to a peptide and thereby, either by virtue of conformational 
25 changes or size or other factors such as steric hinderance does not exhibit any 
or exhibits substantially reduced activity compared to the released active 
therapeutic agent. For example, conjugated doxorubicin is not toxic to cells until 
it is released from the conjugate in a form that can enter the cell. Upon 
cleavage of the agent from the conjugate it Is in active form or in a form that Is 
30 further processed by one or a plurality of steps, including enzymatically or 
chemically. In or on the cell, into an active form. 
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As used herein, an active therapeutic agent is a therapeutic agent that 
has been released from the conjugate by cleavage of the peptidic substrate 
portion of the conjugate. The active therapeutic agent is by virtue of cleavage 
able to exhibit its intended activity, typically by entering the cell. When 
5 conjugated the therapeutic agents have reduced or no activity as therapeutic 
agents, and upon cleavage are released in the vicinity of a cell. 

As used herein, an anti-cancer agent (used interchangeably with "anti- 
tumor or anti-neoplasm agent") refers to any agents used in the treatment of 
cancer. These include any agents, when used alone or in combination with 
10 other compounds, that can alleviate, reduce, ameliorate, prevent, or place or 
maintain in a state of remission of clinical symptoms or diagnostic markers 
associated with neoplasm, tumor or cancer, and can be used in methods, 
combinations and compositions provided herein. Non-limiting examples of anti- 
neoplasm agents include anti-angiogenic agents, alkylating agents, 
15 antimetabolite, certain natural products, platinum coordination complexes, 

anthracenediones, substituted ureas, methyihydrazine derivatives, adrenocortical 
suppressants, certain hormones, antagonists and anti-cancer polysaccharides. 

As used herein, substantially Inactive with reference to the conjugated 
thereapeutic agent means at least 1 %, generally 10, 20, 30, 50, 60, 70, 80 or 
20 90 or 100% inactive compared to the unconjugated therapeutic agent in a 

standard or art- re cognized assays, such as in vitro or in vivo assays, that assess 
the therapeutic activity of the agent. 

As used herein, a targeted cell or tissue refers to the cells or tissues that 
include cell surface proteases that cleave the conjugates. The cells or tissues 
25 can be involved in the disease or can be present at the disease loci or locus by 
virtue of participation in the disease process or merely serendipitousty. 

As used herein, angiogenesis is intended to broadly encompass the 
totality of processes directly or indirectly Involved in the establishment and 
maintenance of new vasculature (neovascularization), including, but not limited 
30 to, neovascularization associated with tumors. 

As used herein, anti-angiogenic treatment or agent refers to any 
therapeutic regimen and compound, that, when used alone or in combination 
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with other treatment or compounds, can alleviate, reduce, ameliorate, prevent, 
or place or maintain in a state of remission, one or more clinical symptoms or 
diagnostic markers associated with undesired and/or uncontrolled angiogenesis. 
Thus, for purposes herein an anti-angiogenic agent refers to an agent that 
5 inhibits the establishment or maintenance of vasculature. Such agents include, 
but are not limited to, anti-tumor agents, and agents for treatments of other 
disorders associated with undesirable angiogenesis, such as diabetic 
retinopathies, hyperproliferative disorders and others. 

As used herein, non-anti-angiogenic anti-tumor agents refer to anti-tumor 
10 agents that do not act primarily by inhibiting angiogenesis. Whether anti-tumor 
agents act primarily by inhibiting angiogenesis can be determined using the 
assays provided herein, or using other assays well known to those of skill in the 
art. 

As used herein, undesired and/or uncontrolled angiogenesis refers to 

15 pathological angiogenesis wherein the influence of angiogenesis stimulators 
outweighs the Influence of angiogenesis inhibitors. As used herein, deficient 
angiogenesis refers to pathological angiogenesis associated with disorders where 
there is a defect in normal angiogenesis resulting in aberrant angiogenesis or an 
absence or substantial reduction in angiogenesis. 

20 As used herein, a cell surface protease is any protease that is located on 

or at a cell surface and/or proteases that are located at the cell surface by virtue 
of a specific binding interaction with a receptor therefor, or that is localized at or 
near or associated with the cell surface. An exemplary protease located at the 
cell surface by virtue of a specific binding interaction with a receptor therefor is 

25 urokinase plasminogen activator (u-PA) bound to urokinase plasminogen 

activator receptor (u-PAR). Hence cell surface proteases contemplated herein 
include cell surface-associated proteases. It also includes all forms thereof that 
can be circulating or inside a cell. To be categorized as a cell surface protease, 
there must be at least one form thereof that is located {i.e. on the surfaces such 

30 as transmembrane protease or bound to receptor therefor) on the surface of a 
cell at some point in its cycle. Cell surface protease include serine proteases. 
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such as, but are not limited to, the transmembrane serine protease (MTSPs) and 
endotheltases and urokinases. 

As used herein, a serine protease (SPI refers to a diverse family of 
proteases in which a serine residue is involved in the hydrolysis of proteins or 
5 peptides. The serine residue can be part of the catalytic triad mechanism, which 
includes a serine, a histidine and an aspartic acid in the catalysis, or be part of 
the hydroxyl/e-amine or hydroxyl/a-amine catalytic dyad mechanism, which 
involves a serine and a lysine in the catalysis. Of particular interest are SPs of 
mammalian, including human, origin. Those of skill in this art recognize that, in 
10 general, single amino acid substitutions in non-essential regions of a polypeptide 
do not substantially alter biological activity {see, e.g., Watson et al. (1987) 
Molecular Biology of the Gene. 4th Edition, The Bejacmin/Cummings Pub. co., 
p.224). 

As used herein shed, soluble and released forms of cell surface proteases 
15 are contemplated. Such forms include, for example, forms found in serum upon 
proteolytic degradation or other removal of the extracellular portion of membrane 
bound protease, and splice variants that do not include a transmembrane 
domain. 

As shown herein, the protease activity of cell surface proteases and 
20 proteases associated with cells can be exploited to provide a means to 

concentrate therapeutic agents, such as cytotoxic agents, near such cells by 
providing conjugates that are activated upon cleavage by such enzymes. Such 
conjugates, upon the action of a cell surface protease or cell-associate protease, 
release the therapeutic agent, such as a cytotoxic agent, or a derivative thereof 
25 that can be converted to a therapeutic agent, locally at the site of action. As 
noted above, the substrates are designed to be substrates of targeted proteases 
that are expressed or are active on the surfaces of cells, such as tumor cells or 
endothelial cells, involved in or present at the site{s) or locus or loci of the 
disease. By virtue of specific expression, localization or activation of such 
30 proteases or the presence of receptors, substrates or enzyme co-factors 

therefor, administration of the conjugates provided herein permits targeting of 
therapeutic agents to such cells. Upon contacting with the proteases, active 
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therapeutic agents are released in the immediate vicinity of the targeted cells. 
For example, specific profiles of some of the MTSPs are as follows. 

As used herein, "transmembrane serine protease (MTSP)" refers to a 
family of transmembrane serine proteases that share common structural features 
5 as described herein (see, also Hooper et al. (2001) J. Biol. Chem.2 7 6:BS7 -360). 
Thus, reference, for example, to "MTSP" encompasses alt proteins encoded by 
the MTSP genes. Including but are not limited to: MTSPI, MTSPS, MTSP4, 
. MTSP6, MTSP7, MTSP9, MTSPI O, MTSPI 2, MTSP20, MTSP22 and MTSP25 
or an equivalent molecule obtained from any other source or that has been 
10 prepared synthetically or that exhibits the same activity. Other MTSPs include, 
but are not limited to, corin, enteropeptidase, human airway trypsin-like protease 
(HAT), TMPRSS2 and TMPRSS4. The MTSPs described herein can be used to 
identify other MTSPs. Methods for isolating nucleic acid encoding other MTSPs, 
including nucleic acid molecules encoding full-length molecules and splice 
15 variants and MTSPs from species, such as cows, sheep, goats, pigs, horses, 
primates, including chimpanzees and gorillas, rodents, dogs, cats and other 
species of interest, such as domesticated animals, farm and zoo animals are 
known to those of skill in the art and are outlined herein. The nucleic acid 
molecules described herein including those set forth in SEQ IDs can be used to 
20 obtain nucleic acid molecules encoding fuli-length MTSP polypeptides from 

human sources or from other species, such as by screening appropriate libraries 
using the nucleic acid molecules or selected primers or probes based thereon. 

Sequences of encoding nucleic acid molecules and the encoded amino 
acid sequences of exemplary MTSPs and/or domains thereof are set forth in SEQ 
25 ID Nos. 1-45, 269-270 and 272-276. The term also encompasses MTSPs with 
amino acid substitutions that do not substantially alter activity of each member 
and also encompasses polyeptides encoded by splice variants thereof. Hence, 
encompassed are MTSPs with amino acid substitutions such that the resulting 
polypeptide retains at least 1%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 
30 80%, 90% of the proteolytic activity of the unaltered polypeptide, and also 

encompasses MTSPs encoded by splice variants thereof and MTSPs encoded by 
allelic variants, such as single nucleotide polymorphisms (SNPs). Suitable 
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substitutions, including, although not necessarily, conservative substitutions of 
amino acids, are known to those of skill in this art and can be made without 
eliminating the biological activity, such as the catalytic activity, of the resulting 
molecule. MTSPs include those of animal, such as mammalian, including 
5 human, origin. 

As used herein, a "protease domain of an MTSP" refers to an 
extracellular protease domain of an MTSP that exhibits proteolytic activity and 
shares homology and structural features with the chymotrypsin/trypsin family 
protease domains. Hence it is at least the minimal portion of the domain that 
10 exhibits proteolytic activity as assessed by standard in vitro assays. 

Contemplated herein are such protease domains and catalytically active portions 
thereof. 

Exemplary MTSP polypeptides, with the protease domains indicated, are 
set forth in SEQ ID Nos. 1-45, 269-270 and 272-276, and including smaller 

15 portions thereof that retain or exhibit protease activity. The protease domains 
vary in size and constitution, including insertions and deletions in surface loops. 
They retain conserved structure, including at least one of the active site triad, 
primary specificity pocket, oxyanlon hole and/or other features of serine 
protease domains of proteases. Thus, for purposes herein, the protease domain 

20 is a portion of a MTSP, as defined herein, and ts homologous to a domain of 
other MTSPs. MTSPs include, MTSPl, MTSP3, MTSP4, MTSP6, MTSP7, 
MTSP9, MTSPl 0, MTSPl 2, MTSP20, MTSP22 and MTSP25 (see SEQ ID Nos. 
1-19, 42-45, 269-270 and 272-276; see, also International PCT application No. 
WO 02/00860 (see SEQ ID Nos. 38 and 97 therein, which provide an MTSPl 2 

25 variant); conn (SEQ ID Nos. 28 and 29), enteropeptidase (SEQ ID Nos. 30 and 
31) human airway trypsin-IIke protease (HAT) (SEQ ID Nos. 32 and 33), hepsin 
(SEQ ID Nos. 34 and 35), TMPRSS2 (SEQ ID Nos. 36 and 37) and TMPRSS4 
(SEQ ID Nos. 38 and 39). As with the larger class of enzymes of the 
chymotrypsin (SI) fold (see, e.g., Internet accessible MEROPS data base), the 

30 MTSPs protease domains share a high degree of amino acid sequence identity. 
The His, Asp and Ser residues necessary for activity are present in conserved 
motifs. In those that are activated by cleavage, the activation site, which 
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resutts in the N-terminus of second chain in the two chain forms has a 
conserved motif and readily can be identified (see, e.g., amino acids 801-806, 
SEQ ID No. 29, amino acids 406-410, SEQ ID No. 31; amino acids 186-190, 
SEQ ID No. 33; amino acids 161-166, SEQ ID No. 35; amino acids 255-259, 
5 SEQ ID No. 37; amino acids 190-1 94, SEQ ID No. 39 and other as known to 
those of skill and the art and/or as described herein). 

For example, with reference to MTSP10 (see SEQ ID Nos. 44 and 45), 
there disulfide bonds as follows: C^eg-Cgo*, O^stC^^^, Ceig-Cega; Cg^-Cg^a (see 
SEQ ID Nos- 44 and 45) (chymotrypsin numbering 42 to 58; 136-201; 168-182 
10 and 191-220). Disulfide bonds form between the Cys residues C573-C29e to link 
the protease domain to another domain so that upon activation cleavage 
(between residues R462 and of SEQ ID No. 45) the resulting polypeptide is a 
two chain molecule. The C573 (SEQ ID NO. 45 is a free Cys in a single chain 
form of the protease domain. As noted the protease also can be provided as a 

15 two chain molecule. Single chain and two chain forms are proteolytically active. 
A two chain form is produced by bonding, typically between the C573 and a Cys 
outside the protease domain, such as CySggg. Upon activation cleavage the 
disulfide bond remains resulting in a two chain polypeptide. The size of chain 
"A" is a function the starting length of the polypeptide prior to activation 

20 cleavage between the R^j and l^gg. Any length polypeptide that includes the 
protease domain (residues 463-692 of SEQ ID No. 45) or catalytically active 
fragments thereof, is contemplated herein. Two chain forms include at least the 
protease domain a polypeptide from Cjgg up to and including C573. 

As used herein, a two-chain form of the protease domain refers to a two- 

25 chain form that is formed from a single chain form of the protease in which the 
Cys pairing between, e.g., a Cys outside the protease domain such as, for 
example CySg73 (SEQ ID No. 45 for MTSP), which links the protease domain to 
the remainder of the polypeptide, the "A" chain. A two chain protease domain 
form refers to any form in which the "remainder of the polypeptide", i.e., "A" 

30 chain, is shortened and includes a Cys from outside the protease domain. 

As used herein, the catalytically active domain of an MTSP refers to the 
protease domain. Reference to the protease domain of an MTSP generally refers 
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to a single chain form of the protein. If the two-chain form or both forms is 
intended, it is so-specified. The zymogen form of each protein is a single chain, 
which is converted to the active two or multi chain form by activation cleavage. 
By active form is meant a form active in vivo or in vitro. 
5 As used herein, activation cleavage refers to the cleavage of the protease 

at the N-terminus of the protease domain (generally between an R and I or V in 
the full-length protein. By virtue of the Cys-Cys pairing between a Cys outside 
the protease domain and a Cys in the protease domain (see, e.g., Cys^y^ SEQ ID 
No. 45, upon cleavage the resulting polypeptide has two chains ("A" chain and 
10 the "B" chain, which is the protease domain of an MTSP). Cleavage can be 
effected by another protease or autocatalytically. The conjugates provided 
herein advantageously contain sites that are recognized by the active cell 
surface protease (or cell-associated protease) and are cleaved thereby to release 
active or an inactive prodrug form of a therapeutic agent. 
15 As used herein an MTSPl , whenever referenced herein, includes at least 

one or all of or any combination of: 

a polypeptide encoded by the sequence of nucleotides set forth in 
SEQ ID No. 1 or 40; 

a polypeptide encoded by a sequence of nucleotides that 
20 hybridizes under conditions of low, moderate or high stringency to the sequence 
of nucleotides set forth in SEQ ID No. 1 or 40; 

a polypeptide that comprises the sequence of amino acids set 
forth in SEQ ID No. 2 or 41; 

a polypeptide that comprises a sequence of amino acids having at 
25 least about 40%, 60%, 70%, 75%, 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 
87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 
99% sequence identity with the sequence of amino acids set forth in SEQ ID 
No. 2 or 41 ; and/or 

a polypeptide encoded by a splice variant of the MTSPl set forth 
30 in SEQ ID No, 1 or 40. 

The MTSPl can be from any animal, particularly a mammal, and includes 
but is not limited to, humans, rodents, fowl, ruminants and other animals. The 
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ful! length zymogen or two chain activated form is contemplated or any domain 
thereof, including the protease domain, which can be a two chain activated 
form, or a single chain form. MTSP1 also is referred to TADG-1 5 and 
matriptase. As described below, the protein originally designated matriptase 
5 appears to be an MTSPl splice variant or processed product. 

As used herein an MTSP3, whenever referenced herein, includes at least 
one or all of or any combination of: 

a polypeptide encoded by the sequence of nucleotides set forth in 

SEQ ID No. 3; 

10 a polypeptide encoded by a sequence of nucleotides that 

hybridizes under conditions of low, moderate or high stringency to the sequence 
of nucleotides set forth in SEQ ID No. 3; 

a polypeptide that comprises the sequence of amino acids set 
forth as amino acids 205-437 of SEQ ID No. 4; 

15 a polypeptide that comprises a sequence of amino acids having af 

least about 40%, 60%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 
87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 
99% sequence Identity with the sequence of amino acids set forth in SEQ ID 
No. 4; and/or 

20 a polypeptide encoded by a splice variant of the MTSP3 set forth 

in SEQ ID Nos. 3 and 4. 

The MTSP3 can be from any animal, particularly a mammal, and includes 
but are not limited to, humans, rodents, fowl, ruminants and other animals. The 
full length zymogen or two chain activated form is contemplated or any domain 
25 thereof, including the protease domain, which can be a two chain activated 
form, or a single chain form. 

As used herein an MTSP4, whenever referenced herein, includes at least 
one or ail of or any combination of: 

a polypeptide encoded by the sequence of nucleotides set forth in 
30 any of SEQ ID No. 5, 7 or 9; 
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a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of low, moderate or high stringency to the sequence 
of nucleotides set forth in any of SEQ ID Nos. 5, 7 or 9; 

a polypeptide that comprises the sequence of amino acids set 
5 forth In any of SEQ ID Nos. 6, 8 or 10; 

a polypeptide that comprises a sequence of amino acids having at 
least about 40%, 60%, 70%, 75%, 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 
87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 
99% sequence identity with the sequence of amino acids set forth in SEQ ID 
10 No. 6, 8 or 10; and/or 

a polypeptide encoded by a splice variant of the MTSP4s set forth 
in SEQ ID Nos. 7-10. 

The MTSP4 can be from any animal, particularly a mammal, and includes 
but are not limited to, humans, rodents, fowl, ruminants and other animals. The 
1 5 full length zymogen or two chain activated form is contemplated or any domain 
thereof, including the protease domain, which can be a two chain activated 
form, or a single chain form. 

As used herein an MTSP6, whenever referenced herein, includes at least 
one or alt of or any combination of: 
20 a polypeptide encoded by the sequence of nucleotides set forth in 

any of SEQ ID No. 11; 

a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of low, moderate or high stringency to the sequence 
of nucleotides set forth in any of SEQ ID Nos. 1 1 ; 
25 a polypeptide that comprises the sequence of amino acids set 

forth in any of SEQ ID No. 1 2; 

a polypeptide that comprises a sequence of amino acids having at 
least about 40%, 60%, 70%, 75%, 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 
87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 
30 99% sequence identity with the sequence of amino acids set forth in SEQ ID 
No. 1 2; and/or 
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a polypeptide encoded by a splice variant of the IVITSPe set forth 
in SEQ ID No. 12. 

The MTSP6 can be from any animal, particularly a mammal, and includes but are 
not limited to, humans, rodents, fowl, ruminants and other animals. The full 
5 length zymogen or two chain activated form is contemplated or any domain 
thereof, including the protease domain, which can be a two chain activated 
form, or a single chain form. Of particular interest herein is the MTSP6 of SEQ 
ID No. 12. 

As used herein an MTSP7, whenever referenced herein, includes at least 
10 one or all of or any combination of: 

a polypeptide encoded by the sequence of nucleotides set forth in 
SEQ ID No. 13; 

a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of low, moderate or high stringency to the sequence 
15 of nucleotides set forth in SEQ ID No. 13; 

a polypeptide that comprises the sequence of amino acids set 
forth in SEQ ID No. 13; 

a polypeptide that comprises a sequence of amino acids having at 
least about 40%, 60%, 70%, 75%, 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 
20 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 
99% sequence identity with the sequence of amino acids set forth in SEQ ID 
No. 14; and/or 

a polypeptide encoded by a splice variant of the MTSP7 set forth 
in SEQ ID No. 13. 

25 The MTSP7 can be from any animal, particularly a mammal, and includes 

but are not limited to, humans, rodents, fowl, ruminants and other animals. The 
full length zymogen or two chain activated form is contemplated or any domain 
thereof, including the protease domain, which can be a two chain activated 
form, or a single chain form. 

30 As used herein an MTSP9, whenever referenced herein, includes at least 

one or all of or any combination of: 
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a polypeptide encoded by the sequence of nucleotides set forth in 
SEQ ID No. 1 7 or SEQ ID No, 42; 

a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of low, moderate or high stringency to the sequence 
5 of nucleotides set forth in SEQ ID No. 17 or 42; 

a polypeptide that comprises the sequence of amino acids set 
forth in SEQ ID No. 1 8 or 43; 

a polypeptide that comprises a sequence of amino acids having at 
least about 40%, 60%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 
10 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 
99% sequence identity with the sequence of amino acids set forth in SEQ ID 
No. 1 8 or 270; and/or 

a polypeptide encoded by a splice variant of the MTSP9 set forth 
in SEQ ID No. 1 7. 

15 The MTSP9 can be from any animal, particularly a mammal, and includes 

but are not limited to, humans, rodents, fowl, ruminants and other animals. The 
full length zymogen or two chain activated form is contemplated or any domain 
thereof, including the protease domain, which can be a two chain activated 
form, or a single chain form. 
20 As used herein an MTSP10, whenever referenced herein, includes at least 

one or ail of or any combination of: 

a polypeptide encoded by the sequence of nucleotides set forth in 
SEQ ID No. 44; 

a polypeptide encoded by a sequence of nucleotides that 
25 hybridizes under conditions of low, moderate or high stringency to the sequence 
of nucleotides set forth in SEQ ID No. 44; 

a polypeptide that comprises the sequence of amino acids set 
forth in SEQ ID No. 45; 

a polypeptide that comprises a sequence of amino acids having at 
30 least about 40%, 60%, 70%, 75%, 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 
87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 
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99% sequence identity with the sequence of amino acids set forth in SEQ ID 
No. 45; and/or 

a polypeptide encoded by a splice variant of the MTSP10 set forth 
in SEQ ID No. 44. 

5 The MTSP10 can be from any animal, particularly a mammal, and 

includes but are not limited to, humans, rodents, fowl, ruminants and other 
animals. The full length zymogen or two chain activated form is contemplated 
or any domain thereof, including the protease domain, which can be a two chain 
activated form, or a single chain form. 
10 MTSP10 polypeptides, including, but not limited to splice variants 

thereof, and nucleic acids encoding MTSPs, and domains, derivatives and 
analogs thereof are provided herein. Single chain protease domains that have an 
N-terminus functionally equivalent to that generated by activation of the 
zymogen form of MTSP10 are also provided. The cleavage site for the protease 
15 domain of MTSPIO is between amino acid R and amino acids I (RillGGT) 
(residues 462-467 SEQ ID No. 45). 

As used herein an MTSP1 2, whenever referenced herein, includes at least 
one or all of or any combination of; SEQ ID No. 19 and 20 

a polypeptide encoded by the sequence of nucleotides set forth in 
20 SEQ ID No. 1 9 or by a sequence of nucleotides that includes nucleotides that 
encode the sequence of amino acids set forth in SEQ ID No. 20; 

a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of low, moderate or high stringency to the sequence 
of nucleotides set forth in is set forth as SEQ ID No. 1 9; 
25 a polypeptide that includes the sequence of amino acids set forth 

in SEQ ID No. 20 or a catalytically active portion thereof; 

a polypeptide that includes a sequence of amino acids having at 
least about 40%, 60%, 70%, 75%, 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 
87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 
30 99% sequence identity with the sequence of amino acids set forth in SEQ ID 
No. 20; and/or 
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a polypeptide encoded by a splice variant of the MTSP1 2 that 
includes the sequence of amino acids set forth in SEQ ID No, 20. 

In particular, the MTSP12 polypeptide, with the protease domains as 
indicated in SEQ ID Nos. 1 9 and 20, is provided. The polypeptide is a single or 
5 multi-chain polypeptide. A protease domain of an MTSPl 2, whenever 
referenced herein, includes at least one or all of or any combination of or a 
catalytically active portion of: 

a polypeptide that includes the sequence of amino acids set forth 
in SEQ ID No. 20 or a catalytically active portion thereof but that does not 
10 include the sequence of amino acids set forth in SEQ ID No. 271 ; 

a polypeptide that includes the sequence of amino acids set forth 
in SEQ ID No. 272 or a catalytically active fragment thereof; 

a polyeptide containing amino acids 237 to 456 of SEQ ID No. 6, 
a polypeptide containing amino aicds 538 to 765 of SEQ ID No. 20, and a 
15 polypeptide containing amino acids 861 to 1087 of SEQ ID No. 20, but that 
, does not include the sequence of amino acids set forth in SEQ ID No. 271 ; 

a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of low, moderate or high stringency to a sequence 
of nucleotides that encodes any of the polypeptides of a)-c); 
20 a polypeptide encoded by a sequence of nucleotides that 

hybridizes under conditions of low, moderate or high stringency to the sequence 
of nucleotides set forth in SEQ ID No. 20 but that does not encode the sequence 
of amino acids set forth in SEQ ID No. 271 ; 

a polypeptide that includes a sequence of amino acids having at 
25 least about 60%, 70%, 75%, 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 
88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% 
sequence identity with the sequence of amino acids set forth in SEQ ID No. 20; 

a polypeptide that includes a sequence of amino acids having at 
least about 60%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 
30 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% 

sequence identity with the sequence of amino acids of the polypeptides of a)-e); 
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a polypeptide encoded by a splice variant of a sequence of 
nucleotides that encodes an MTSP12 of any of the above. 

Smaller portions thereof that retain protease activity are also provided. 
The MTSP12 can be from any animal, particularly a mammal, and includes but 
5 are not limited to, humans, rodents, fowl, ruminants and other animals. The 
full-length zymogen or two-chain activated form is contemplated or any domain 
thereof, including the protease domain, which can be a two-chain activated 
form, or a single chain form. MTSP1 2 also includes the variant described 
Internationa! PCT application No. WO 02/00860 (see SEQ ID Nos. 38 and 97 
10 therein). 

As used herein an MTSP20, whenever referenced herein, includes at 
least one or all of or any combination of: 

a polypeptide encoded by the sequence of nucleotides set forth in 
SEQ ID No. 273; 

15 a polypeptide encoded by a sequence of nucleotides that 

hybridizes under conditions of low, moderate or high stringency to the sequence 
of nucleotides set forth in SEQ ID No. 273; 

a polypeptide that comprises the sequence of amino acids set 
forth in SEQ ID No; 273; 

20 a polypetide that comprises a sequence of amino acids having at 

least about 40%, 60%, 70%, 75%, 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 
87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%. 97%, 98% or 
99% sequence identity with the sequence of amino acids set forth in SEQ ID 
No. 274; and/or 

25 a polypeptide encoded by a splice variant of the MTSP20 encoded 

by the sequence of nucleotides that includes the sequence set forth in SEQ ID 
No. 273. 

The MTSP20 may be from any animal, particularly a mammal, and 
includes but are not limited to, humans, rodents, fowl, ruminants and other 
30 animals. The full length zymogen or two-chain activated form Is contemplated 
or any domain thereof, including the protease domain, which can be a two-chain 
activated form, or a single chain form. 
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As used herein an MTSP22, whenever referenced herein, includes at 
least one or all of or any combination of: 

a polypeptide encoded by the sequence of nucleotides set forth in 
SEQ ID No. 275; 

5 a polypeptide encoded by a sequence of nucleotides that 

hybridizes under conditions of low, moderate or high stringency to the sequence 
of nucleotides set forth in SEQ ID No. 275; 

a polypeptide that comprises the sequence of amino acids set 
forth in SEQ ID No. 276; 
10 a polypetide that comprises a sequence of amino acids having at 

least about 40%, 60%, 70%, 75%, 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 
87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 
99% sequence identity with the sequence of amino acids set forth in SEQ ID 
No. 276; and/or 

15 a polypeptide encoded by a splice variant of the MTSP22 set forth 

in SEQ ID No. 275. 

The MTSP22 may be from any animal, particularly a mammal, and 
includes but are not limited to, humans, rodents, fowl, ruminants and other 
animals. The full length zymogen or two-chain activated form Is contemplated 
20 or any domain thereof, including the protease domain, which can be a two-chain 
activated form, or a single chain form. 

As used herein an MTSP25, whenever referenced herein, includes at 
least one or all of or any combination of: 

a polypeptide encoded by the sequence of nucleotides set forth in 
25 SEQ ID No. 269; 

a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of low, moderate or high stringency to the sequence 
of nucleotides set forth in SEQ ID No. 269; 

a polypeptide that comprises the sequence of amino acids set 
30 forth in SEQ ID No. 270; 

a polypetide that comprises a sequence of amino acids having at 
least about 40%, 60%, 70%, 75%, 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 
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87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 
99% sequence identity with the sequence of amino acids set forth in SEQ ID 
No. 270; and/or 

a polypeptide encoded by a splice variant of the IVITSP25 set forth 
5 in SEQ ID No. 269. 

The MTSP25 may be from any animal, particularly a mammal, and 
includes but are not limited to, humans, rodents, fowl, ruminants and other 
animals. The full length zymogen or two-chain activated form is contemplated 
or any domain thereof, including the protease domain, which can be a two-chain 
10 activated form, or a single chain form. 

As used herein, a human protein is one encoded by nucleic acid present 
In the genome of a human, including all allelic variants and conservative 
variations as long as they are not variants found in other mammals. 

As used herein, not substantially cleaved by plasmin or prostate specific 
15 antigen (PSA) (or other non-cell surface-associated protease), means in 

comparable in vitro assays (under optimal conditions for each enzyme) in which 
the activity of a targeted cell surface membrane protease or catalytically active 
portion of the activity of the protease domain (or a catalytically active form 
thereof) of prostate specific antigen (PSA) or plasmin for cleavage of the 
20 conjugate is compared, the relative activity is greater than at least 2:1 , 3:1 , 4:1 , 
5:1, 10:1, 50:1 or 100:1. 

As used herein, activity refers to the ratio V^JK^^, where kgat is the rate of 
catalytic turnover for a particular enzyme, and is the Michaelis constant for 
the binding of the substrate. 
25 As used herein, a "nucleic acid encoding a protease domain or 

catalytically active portion of a MTSP" shall be construed as referring to a 
nucleic acid encoding only the recited single chain protease domain or active 
portion thereof, and not the other contiguous portions of the MTSP as a 
continuous sequence. 
30 As used herein, a CUB domain is a motif that mediates protein-protein 

interactions in complement components 
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C1r/C1s and has also been identified in various proteins involved in 
developmental processes. 

As used herein, a zymogen is an enzymatically inactive protein (i.e, 
typically, but not necessarily, less than 1 % of active form) that is converted to a 
5 proteolytic enzyme by the action of an activator, including by autoactivation. 
Inactive means less active than the form those of skill in the art consider to be 
the active form of the enzyme. The ratio of activity of a zymogen to the 
activated form varies from enzyme-to-enzyme. 

As used herein, "disease or disorder" refers to a pathological condition in 

10 an organism resulting from, e.g., infection or genetic defect, and characterized 
by identifiable symptoms. The diseases contemplated for treatment herein are 
any for which a cell surface protease, including a cell-localized or cell-associated 
protease is asssociated with a targeted cell or tissue involved in the disease or 
disease process. Such association can be because the protease is involved in 

15 the disease or is serendipttously associated with cells involved with the disease. 
These diseases herein are called cell surface protease-associated diseases. 
Hence, to treat th disease a cellsurface protease is identified that is expressed 
on cells associated with the disorder, such as, for example, immune cells for 
treating inflammatory diseases, and viralty infected cells for treating viral 

20 diseases. The conjugate is designed as described herein for cleavage by the 
selected protease. 

As used herein, neoplasm (neoplasia) refers to abnormal new growth, and 
thus means the same as tumor, which can be benign or malignant. Unlike 
hyperplasia, neoplastic proliferation persists even in the absence of the original 
25 stimulus. 

As used herein, neoplastic disease refers to any disorder involving cancer, 
including tumor development, growth, metastasis and progression. 

As used herein, cancer refers to a general term for diseases caused by 
any type of malignant tumor. 
30 As used herein, malignant, as applied to tumors, refers to primary tumors 

that have the capacity of metastasis with loss of growth control and positional 
control. 
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As used herein, endotheliase refers to a mammalian protein, including 
human protein, that has a transmembrane domain and is expressed or active on 
the surface of endothelial cells and includes a protease domain, particularly an 
extracellular protease domain, and is generally a serine protease {see, also U.S. 
5 application Serial No. 09/717,473 and International PCT application No. WO 
01/36604). Thus, reference, for example, to endotheliase encompasses all 
proteins encoded by the endotheliase gene family, or an equivalent molecule 
obtained from any other source or that has been prepared synthetically or that 
exhibits the same activity. The endotheliase gene family are transmembrane 

10 proteases expressed or active in endothelial cells. These proteases include 
serine proteases. These include proteins that have these features and also 
include a protease domain that exhibits sequence homology to the endotheliases 
1 and 2. Endotheliase 1 and 2, for example exhibit about 40% or 45% identity. 
Sequence homology means sequence identity along its length when aligned to 

15 maximize identity of at least about 25%, 40%, 60%, 70%, 75%, 80%, 81%, 
82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98% or 99% or greater number of residues. Sequence 
homology also is assessed by determining whether the encoding sequences of 
nucleic acids hybridize under conditions of at least moderate, or for more closely 

20 related proteins, high stringency to the nucleic acid molecules provided herein or 
to those that encode the same proteins but differ in sequence by virtue of the 
degeneracy of the genetic code. In addition, "endotheliases" encompasses 
endotheliases with amino acid substitutions, including those set forth in Table 1, 
such that the resulting polypeptide retains at least 1%, 5%, 10%, 20%, 30%, 

25 40%, 50%, 60%, 70%, 80%, 90% of the proteolytic activity of the unaltered 
polypeptide. Suitable substitutions of amino acids are known to those of skill in 
this art and can be made generally without altering the biological activity of the 
resulting molecule. As noted, those of skill in this art recognize that, in general, 
single amino acid substitutions in non-essential regions of a polypeptide do not 

30 substantially alter biological activity (see, e.g., Watson et al. Molecular Biology 
of the Gene, 4th Edition, 1987, The Bejacmin/Cummings Pub. Co., p.224). Also 
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included within the definition of "endotheltases", is the catalytically active 
fragment or shed forms of an endotheliase. 

As used herein an endotheliase 1 , whenever referenced herein, includes 
at least one or all of or any combination of: 
5 a polypeptide encoded by the sequence of nucleotides set forth in 

SEQIDNo. 21; 

a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of low, moderate or high stringency to the sequence 
of nucleotides set forth in SEQ ID No. 21 ; 
10 a polypeptide that comprises the sequence of amino acids set 

forth in SEQ ID No. 22; 

a polypeptide that comprises a sequence of amino acids having at 
least about 40%, 60%, 70%, 75%, 80%, 81 %. 82%, 83%, 84%, 85%, 86%, 
87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 
15 99% sequence identity with the sequence of amino acids set forth in SEQ ID 
No. 22; and/or 

a polypeptide encoded by a splice variant of a nucleic acid 
molecule that encodes a protein containing the polypeptide set forth in SEQ ID 
No. 22. 

20 The endotheliase 1 can be from any animal, particularly a mammal, and 

includes but are not limited to, humans, rodents, fowl, ruminants and other 
animals. The full length zymogen or two chain activated form is contemplated 
or any domain thereof, including the protease domain, which can be a two chain 
activated form, or a single chain form. 

25 As used herein an endotheliase 2, whenever referenced herein, includes 

at least one or all of or any combination of: 

a polypeptide encoded by the sequence of nucleotides set forth in 
SEQ ID No. 23 or 25; 

a polypeptide encoded by a sequence of nucleotides that 

30 hybridizes under conditions of low, moderate or high stringency .to the sequence 
of nucleotides set forth in SEQ ID No. 23 or 25; 
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a polypeptide that comprises the sequence of amino acids set 
forth in SEQ ID No. 24 or 26; 

a polypeptide that comprises a sequence of amino acids having at 
least about 40%, 60%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 
5 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 
99% sequence identity with the sequence of amino acids set forth in SEQ ID 
No. 24 or 26; and/or 

a polypeptide encoded by a splice variant of a nucleic acid set 
forth in SEQ ID No. 23 or 25. 
10 The endotheliase 2 can be from any animal, particularly a mammal, and 

includes but are not limited to, humans, rodents, fowl, ruminants and other 
animals. The full length zymogen or two chain activated form is contemplated 
or any domain thereof, including the protease domain, which can be a two chain 
activated form, or a single chain form. 
15 As used herein, the protease domain of an endotheliase refers to the 

polypeptide portion of the endotheliase that is the extracellular portion that 
exhibits protease activity. The protease domain is a polypeptide that includes at 
least the minimum number of amino acids, generally more than 50 or 1 00, 
required for protease activity. Protease activity can be assessed empirically, 
20 such as by testing the polypeptide for its ability to act as a protease. Assays, 
such as those described in the EXAMPLES, with the exception that a known 
endotheliase substrate is employed in place of the test compounds, can be used 
to assess protease activity. Furthermore, since proteases, particularly serine 
proteases, have characteristic structures and sequences or motifs, the protease 
25 domain can be readily identified by such structure and sequence or motif. 

As used herein, a portion of protease domain of endotheliase refers to a 
portion of protease domain of endotheliase that is located within or is the 
extracellular domain of an endotheliase and exhibits serine proteolytic activity. 
Hence, it is at least the minimal portion of the extracellular domain that exhibits 
30 proteolytic activity as assessed by standard assays. An exemplary protease 

domain of an endotheliase is set forth in SEQ ID No. 22 and as amino acids 321- 
688 and 321-562 of SEQ ID Nos. 24 and 26, respectively. Smaller portions 
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thereof that retain protease activity are contemplated. The protease domains 
vary in size and constitution, including insertions and deletions in surface loops. 
Such domains exhibit conserved structure, including at least one structural 
feature, such as the active site triad, primary specificity pocket, oxyanion hole 
5 and/or other features of serine protease domains of proteases. Thus, for 

purposes herein, the protease domain is a portion of an endotheliase, as defined 
herein, but is homologous in terms of structural features and retention of 
sequence of similarity or homology the protease domain of chymotrypsin or 
trypsin. 

10 As used herein, homologous means about greater than about 25%, 40%, 

60%, 80%, 90%, 95%, 98% or greater sequence identity. By sequence 
identity, the number of conserved amino acids as determined by standard 
alignment algorithms programs, and used with default gap penalties established 
by each supplier. Also homology can be assessed by conserved nucleic acid 

15 sequence, which includes anything that hybridizes under at least low stringency 
conditions and encodes the domain. Similarly, nucleic acid sequence alignment 
programs are commercially available (DNAStar "MegAlign" program {Madison, 
Wl) and the University of Wisconsin Genetics Computer Group (UWG) "Gap" 
program (Madison, Wl)). Substantially homologous nucleic acid molecules 

20 would hybridize typically at moderate stringency or at high stringency all along 
the length of the nucleic acid of interest. Also contemplated are nucleic acid 
molecules that contain degenerate codons in place of codons in the hybridizing 
nucleic acid molecule. 

As used herein, recitation that a polypeptide consists essentially of the 

25 protease domain means that the only endotheliase portion of the polypeptide is a 
protease domain or a catatytically active portion thereof. The polypeptide can 
optionally Include additional non-endotheliase-derived sequences of amino acids. 

As used herein, domain refers to a portion of a molecule, e.g., proteins 
or nucleic acids, that is structurally and/or functionally distinct from other 

30 portions of the molecule. 
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As used herein, an active form of a protease refers to an enzyme that 
catalyzes hydrolysis of proteins or peptides. Reference to a protease includes 
the active and zymogen or other less active form. 

As used herein, nucleic acids include DNA, RNA and analogs thereof, 
5 including peptide nucleic acids (PNA) and mixtures thereof. Nucleic acids can be 
single or two stranded. When referring to probes or primers, optionally labeled, 
with a detectable label, such as a fluorescent or radioiabel, single-stranded 
molecules are contemplated. Such molecules are typically of a length such that 
their targets are statistically unique or of low copy number (typically less than 5, 
10 generally less than 3) for probing or priming a library. Generally a probe or 

primer contains at least 14, 1 6 or 30 contiguous of sequence complementary to 
or identical to a gene of interest. Probes and primers can be 10, 20, 30, 50, 
100 or more nucleic acids long. 

As used herein, nucleic acid encoding a fragment or portion of an 
15 endotheliase refers to a nucleic acid encoding only the recited fragment or 
portion of endotheliase protein, and not the other contiguous portions of the 
endotheliase as a continuous sequence. 

As used herein, heterologous nucleic acid is nucleic acid that, if it is DNA 
encodes RNA, or, if RNA, encodes proteins that generally are not normally 
20 produced in vivo by the cell in which it is expressed or that mediates or encodes 
mediators that alter expression of endogenous nucleic acid, such as DNA, by 
affecting transcription, translation, or other regulatable biochemical processes or 
that is located in a different locus from its normal locus. Heterologous nucleic 
acid is generally not endogenous to the cell into which it is introduced, but has 
25 been obtained from another cell or prepared synthetically. Generally, although 
not necessarily, such nucleic acid encodes RNA and proteins that are not 
normally produced by the cell in which it is now expressed. 

Heterologous nucleic acid, such as DNA, also be referred to as foreign 
nucleic acid, such as DNA. Any nucleic acid, such as DNA, that one of skill in 
30 the art would recognize or consider as heterologous or foreign to the cell in 
which is expressed is herein encompassed by heterologous nucleic acid; 
heterologous nucleic acid includes exogenously added nucleic acid that is also 
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expressed endogenously. Examples of heterologous nucleic acid include, but are 
not limited to, nucleic acid that encodes traceable marker proteins, such as a 
protein that confers drug resistance, nucleic acid that encodes therapeutically 
effective substances, such as anti-cancer agents, enzymes and hormones, and 
5 nucleic acid, such as DNA, that encodes other types of proteins, such as 

antibodies, and RNA, such as RNA interference (RNAi) or other double-stranded 
RNA, and antisense RNA. Antibodies that are encoded by heterologous nucleic 
acid can be secreted or expressed on the surface of the cell in which the 
heterologous nucleic acid has been introduced. 
10 For example, nucleic acid can be the the targeted agent, such as the 

therapeutic or diagnostic agent, in the conjugate. Nucleic acids, include ds RNA 
use for RNA Interference (RNAi) (see, e.g. Chuang et ai (2000) Proc. Nat/. 
Acad. Sci. U.S.A. 57:4985) which is employed to inhibit the expression of a 
targeted gene by generating loss-of-f unction. Methods relating to the use of 
15 RNAi to silence genes in organisms including, mammals, C. elegans, Drosophila 
and plants, and humans are known (see, e.g., Fire et a/. (1998) Nature 357:806- 
811 Fire (1999) Trends Genet. /5:358-363; Sharp (2001) Genes Dev. f 5:486- 
490; Hammond, eta/. (2001) Nature Rev. Genet.2:1 10-1 119; Tuschl (2001) 
C/jem. Biochem. 2:239-245; Hamilton eta/. (1999) Science 255:950-952; 
20 Hammond et al. (2000) Nature 404:293-296; Zamore et al. (2000) Ce// 707:25- 
33; Bernstein eta/. (2001) Nature 409: 363-366; Elbashir eta/. (2001) Genes 
Dev. 75:188-200; Elbashir s/. (2001 ) /Vari/z-e 47 7:494-498; International PCT 
application No. WO 01/29058; International PCT application No. WO 
99/32619). By selecting appropriate sequences, expression of dsRNA can 
25 interfere with accumulation of endogenous mRNA encoding a targeted gene 
product. Regions that include at least about 21 nucleotides and that are 
selective (I.e. whose target is unique) for the nucleic acid encoding a targeted 
gene product are used to prepare the RNAi. 



wo 02/095007 



PCT/US02/16819 



-30- 

As used herein, genetic therapy involves the transfer of heterologous 
nucleic acid, such as DNA, into certain cells, target cells, of a mannnnal, 
particularly a human, with a disorder or conditions for which such therapy is 
sought. The nucleic acid molecules are included in a conjugate linked via a cell 
5 surface protein cleavage site. The nucleic acid, such as DNA, is introduced into 
the selected target cells in a manner such that the heterologous nucleic acid, 
such as DNA, is expressed and a therapeutic product encoded thereby is 
produced. Alternatively the heterologous nucleic acid, such as DNA, can in 
some manner mediate expression of DNA that encodes the therapeutic product, 
10 or it can encode a product, such as a peptide or RNA that in some manner 
mediates, directly or indirectly, expression of a therapeutic product. Genetic 
therapy can also be used to deliver nucleic acid encoding a gene product that 
replaces a defective gene or supplements a gene product produced by the 
mammal or the cell in which it is introduced. The introduced nucleic acid can 

15 encode a therapeutic compound, such as a growth factor inhibitor thereof, or a 
tumor necrosis factor or inhibitor thereof, such as a receptor therefor, that is not 
normally produced in the mammalian host or that is not produced in 
therapeutically effective amounts or at a therapeutically useful time. The 
heterologous nucleic acid, such as DNA, encoding the therapeutic product can 

20 be modified prior to introduction into the cells of the afflicted host in order to 
enhance or otherwise alter the product or expression thereof. Genetic therapy 
can also involve delivery of an inhibitor or repressor or other modulator of gene 
expression, such dsRNA or antisense or other nucleic acid molecule. The 
conjugates herein can be used to deliver a product, such as a nucleic acid for 

25 gene therapy. 

As used herein, a therapeutically effective product for gene therapy is a 
product that is encoded by heterologous nucleic acid, typically DNA, that, upon 
introduction of the nucleic acid into a host, a product is expressed that 
ameliorates or eliminates the symptoms, manifestations of an inherited or 

30 acquired disease or that cures the disease. Also included are biologically active 
nucleic acid molecules, such as RNAi and antisense. 
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As used herein, a sequence complementary to at least a portion of an 
RNA, with reference to antisense oligonucieotides, means a sequence having 
sufficient complementarily to be able to hybridize with the RNA, generally under 
moderate or high stringency conditions, forming a stable duplex; in the case of 
5 double-stranded SP antisense nucleic acids, a single strand of the duplex DNA 
(or dsRNA) can thus be tested, or triplex formation can be assayed. The ability 
to hybridize depends on the degree of complementarily and the length of the 
antisense nucleic acid. Generally, the longer the hybridizing nucleic acid, the 
more base mismatches with a SP encoding RNA it can contain and still form a 

10 stable duplex (or triplex, as the case can be). One skilled in the art can ascertain 
a tolerable degree of mismatch by use of standard procedures to determine the 
melting point of the hybridized complex. 

Amino acid substitutions can be made or occur in any SPs and protease 
domains thereof. Amino acid substitutions include conservative substitutions, 

15 such as those set forth in Table 1 , which do not eliminate proteolytic activity. 
As described herein, substitutions that alter properties of the proteins, such as 
removal of cleavage sites and other such sites are also contemplated; such 
substitutions are generally non-conservative, but can be readily effected by 
those of skill in the art. 

20 Suitable conservative substitutions of amino acids are known to those of 

skill in this art and can be made generally without altering the biological activity, 
for example enzymatic activity, of the resulting molecule. Also included within 
the definition, is the catalytically active fragment of an SP, particularly a single 
chain protease portion. 

25 Conservative amino acid substitutions are made, for example, in accordance 
with those set forth in TABLE 1 as follows: 
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TABLE 1 





Ala (A) 


Gly; Ser 




Arg {R) 


Lys, Orn 


5 


Asn (N) 


Gin; His 




Asp (D) 


Glu 




Cys to 


Ser 




Gin (Q) 


Asn 




Glu (E) 


Asp 


10 


Gly (G) 


Ala; Pro 




His (H) 


Asn; Gin 




lie (i) 


Leu; Val; Nte; Met 




Leu (L) 


lie; Val; NIe; Met 




Lys (K) 


Arg; Gin; Glu 


15 


Met (M) 


Leu; Tyr; lie; NIe 




Phe (F) 


Met; Leu; Tyr, Trp 




Ser (S) 


Thr 




Thr (T) 


Ser 




Trp (W) 


Tyr; Phe 


20 


Tyr (Y) 


Trp; Phe 




Val (V) 


He; Leu; NIe; Met 



Other substitutions are also permissible and can be determined empirically or in 
accord with known conservative substitutions. For example, one or more amino 
acid residues within the sequence can be substituted by another amino acid of a 

25 similar polarity which acts as a functional equivalent, resulting in a silent 

alteration. Substitutes for an amino acid within the sequence can be selected 
from other members of the class to which the amino acid belongs. For example, 
the nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, 
valine, proline, phenylalanine, tryptophan and methionine. The polar neutral 

30 amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, 
and glutamine. The positively charged (basic) amino acids include arginine, 
lysine and histidine. The negatively charged (acidic) amino acids include aspartic 
acid and glutamic acid. 

As used herein, the amino acids, which occur in the various amino acid 

35 sequences appearing herein, are identified according to their well-known, three- 
letter or one-letter abbreviations. The nucleotides, which occur in the various 
DNA fragments, are designated with the standard single-letter designations used 
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routinely in the art. Other abbreviations. Include: hR or hArg for homoarginine; 
hY or hTyr for homotyrosine; Cha for cyclohexylalanine; Amf for 
4-anninomethylphenylaianine; DPL for 2-(4,6-dimethylpyrimidinyl)lysine; 
{rmidazolyDK for N'-(2-imidazolyl)lysine; Me2P03-Y for 
5 0-dimethylphosphotyrosine; O-Me-Y for 0-methyltyrosine; TIC for 

tetrahydro-3-isoquinoline carboxylic acid; MeL for 2-keto-3-amino-5-methyl- 
hexane; DAP for 1 ,3-dianninopropane; TFA for trifluoroacetic acid; AA for acetic 
acid. 

As used herein, a splice variant refers to a variant produced by 
10 differential processing of a primary transcript of genomic DNA that results in 
more than one type of mRNA. 

As used herein, a probe or primer based on a nucleotide sequence 
disclosed herein, includes at least 10, 14, generally at least 1 6 or 30 or 1 00 
contiguous sequence of nucleotides. 
15 As used herein, anttsense polynucleotides refer to synthetic sequences of 

nucleotide bases complementary to mRNA or the sense strand of double- 
stranded DNA. Admixture of sense and antisense polynucleotides under 
appropriate conditions leads to the binding of the two molecules, or 
hybridization. When these polynucleotides bind to {hybridize with) mRNA, 
20 inhibition of protein synthesis (translation) occurs. When these polynucleotides 
bind to double-stranded DNA, inhibition of RNA synthesis {transcription) occurs. 
The resulting inhibition of translation and/or transcription leads to an inhibition of 
the synthesis of the protein encoded by the sense strand. Antisense nucleic 
acid molecules typically contain a sufficient number of nucleotides to specifically 
25 bind to a target nucleic acid, generally at least 5 contiguous nucleotides, often 
at least 14 or 16 or 30 contiguous nucleotides or modified nucleotides 
complementary to the coding portion of a nucleic acid molecule that encodes a 
gene of interest, for example, nucleic acid encoding a single chain protease 
domain of an SP. 

30 As used herein, an array refers to a collection of elements, such as 

antibodies, containing three or more members. An addressable array is one in 
which the members of the array are identifiable, typically by position on a solid 
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phase support- Hence, in general the members of the array are immobilized on 
discrete identifiable loci on the surface of a solid phase. 

As used herein, antibody refers to an immunoglobulin, whether natural or 
partially or wholly synthetically produced, including any derivative thereof that 
5 retains the specific binding ability of the antibody. Hence antibody includes any 
protein having a binding domain that is homologous or substantially homologous 
to an immunoglobulin binding domain. Antibodies include members of any 
immunoglobulin claims, including IgG, IgM, IgA, IgD and IgE. 

As used herein, antibody fragment refers to any derivative of an antibody 
10 that is less than full-length, retaining at least a portion of the full-length 

antibody's specific binding ability. Examples of antibody fragments include, but 
are not limited to. Fab, Fab', F{ab)2, single-chain Fvs (scFV), FV, dsFV diabody 
and Fd fragments. The fragment can include multiple chains linked together, 
such as by disulfide bridges. An antibody fragment generally contains at least 
15 about 50 amino acids and typically at least 200 amino acids. 

As used herein, an Fv antibody fragment is composed of one variable 
heavy domain (VJ and one variable light domain linked by noncovalent 
interactions. 

As used herein, a dsFV refers to an Fv with an engineered intermolecular 
20 disulfide bond, which stabilizes the Vh-V^ pair. 

As used herein, an Flabjj fragment is an antibody fragment that results 
from digestion of an immunoglobulin with pepsin at pH 4.0-4.5; it can be 
recombinantly expressed to produce the equivalent fragment. 

As used herein. Fab fragments are antibody fragments that result from 
25 digestion of an immunoglobulin with papain; they can be recombinantly 
expressed to produce the equivalent fragment. 

As used herein, scFVs refer to antibody fragments that contain a variable 
light chain (Vl) and variable heavy chain (V^) covatently connected by a 
polypeptide linker in any order. The linker is of a length such that the two 
30 variable domains are bridged without substantial interference. Exemplarly linkers 
include, but are not limited to, (Gly-Ser)n residues, which can include ome Glu or 
Lys residues dispersed throughout, for example, to increase solubility. 
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As used herein, humanized antibodies refer to antibodies that are 
modified to include human sequences of amino acids so that administration to a 
human does not provolce an immune response. Methods for preparation of 
such antibodies are known. For example, to produce such antibodies, the 
5 encoding nucleic acid in the hybridoma or other prokaryotic or eukaryotic cell, 
such as an E. coli or a CHO cell, that expresses the monoclonal antibody is 
altered by recombinant nucleic acid techniques to express an antibody in which 
the amino acid composition of the non-variable region is based on human 
antibodies. Computer programs have been designed to identify such non- 
10 variable regions. 

As used herein, diabodies are dimeric scFV; dtabodies typically have 
shorter peptide linkers than scFvs, and they generally dimerize. 

As used herein, production by recombinant means by using recombinant 
DMA methods means the use of the well known methods of molecular biology 
1 5 for expressing proteins encoded by cloned DNA. 

As used herein, the term assessing is intended to include quantitative and 
qualitative determination in the sense of obtaining an absolute value for the 
activity of an SP, or a domain thereof, present in the sample, and also of 
obtaining an index, ratio, percentage, visual or other value indicative of the level 
20 of the activity. Assessment can be direct or indirect and the chemical species 
actually detected need not of course be the proteolysis product itself but can for 
example be a derivative thereof or some further substance. 

As used herein, biological activity refers to the in vivo activities of a 
compound or physiological responses that result upon in vivo administration of a 
25 compound, composition or other mixture. Biological activity, thus, encompasses 
therapeutic effects and pharmaceutical activity of such compounds, 
compositions and mixtures. Biological activities can be observed in in vitro 
systems designed to test or use such activities. 

As used herein, a combination refers to any association between two or 
30 among more items. 
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As used herein, fluid refers to any composition that can flow. Fluids thus 
encompass connpositions that are in the form of semi-solids, pastes, solutions, 
aqueous mixtures, gels, lotions, creams and other such compositions. 

As used herein, an effective amount of a compound for treating a 
5 particular disease is an amount that is sufficient to ameliorate, or in some 

manner reduce the symptoms associated with the disease. Such amount can be 
administered as a single dosage or can be administered according to a regimen, 
whereby it is effective. The amount can cure the disease but, typically, is 
administered in order to ameliorate the symptoms of the disease. Repeated 

10 administration can be required to achieve the desired amelioration of symptoms. 

As used herein, equivalent, when referring to two sequences of nucleic 
acids, means that the two sequences in question encode the same sequence of 
amino acids or equivalent proteins. When equivalent is used in referring to two 
proteins or peptides, it means that the two proteins or peptides have 

15 substantially the same amino acid sequence with amino acid substitutions (see, 
e.g., Table 1 , above) that do not substantially alter the activity or function of the 
protein or peptide [i.e, retain at least about 1 % of the activity). When 
equivalent refers to a property, the property does not need to be present to the 
same extent {e.g., two peptides can exhibit different rates of the same type of 

20 enzymatic activity), but the activities are generally substantially the same. 

Complementary, when referring to two nucleotide sequences, means that the 
two sequences of nucleotides are capable of hybridizing, typically with less than 
25%, often with less than 15%, or even less than 5% or with no mismatches 
between opposed nucleotides. Generally the two molecules hybridize under 

25 conditions of high stringency. 

As used herein, a method for treating or preventing disease or disorder 
associated wrth undestred and/or uncontrolled angiogenesis means that the 
diseases or the symptoms associated with the undesired and/or uncontrolled 
angiogenesis are alleviated, reduced, ameliorated, prevented, placed in a state of 

30 remission, or maintained in a state of remission. It also means that the 

hallmarks of pathological angiogenesis are eliminated, reduced or prevented by 
the treatment. Non-limiting examples of the hallmarks of the pathological 
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angiogenesis include uncontrolled degradation of the basement membrane and 
proximal extracellular matrix of the endothelial cells, migration, division, and 
organization of the endothelial cells Into new functioning capillaries, and the 
persistence of such functioning capillaries. 
5 As used herein, operatively linked or operationally associated refers to the 

functional relationship of DNA with regulatory and effector sequences of 
nucleotides, such as promoters, enhancers, transcriptional and translational stop 
sites, and other signal sequences. For example, operative linkage of DNA to a 
promoter refers to the physical and functional relationship between the DNA and 

10 the promoter such that the transcription of such DNA is initiated from the 
promoter by an RNA polymerase that specifically recognizes, binds to and 
transcribes the DNA. In order to optimize expression and/or in vitro 
transcription, it can be necessary to remove, add or alter 5' untranslated 
portions of the clones to eliminate extra, potential inappropriate alternative 

15 translation initiation {i.e., start) codons or other sequences that can interfere 
with or reduce expression, either at the level of transcription or translation. 
Alternatively, consensus ribosome binding sites (see, e.g., Kozak (1991) J. Biof. 
Chem. 255:19867-19870) can be inserted immediately 5' of the start codon 
and can enhance expression. The desirability of (or need for) such modification 

20 can be empirically determined. 

As used herein, a promoter region or promoter element refers to a 
segment of DNA or RNA that controls transcription of the DNA or RNA to which 
it is operatively linked. The promoter region includes specific sequences that are 
sufficient for RNA polymerase recognition, binding and transcription initiation. 

25 This portion of the promoter region is referred to as the promoter. In addition, 
the promoter region includes sequences that modulate this recognition, binding 
and transcription initiation activity of RNA polymerase. These sequences can be 
CIS acting or can be responsive to trans acting factors. Promoters, depending 
upon the nature of the regulation, can be constitutive or regulated. Exemplary 

30 promoters contemplated for use in prokaryotes include the bacteriophage T7 and 
T3 promoters. 
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As used herein, sample refers to anything which can contain an analyte 
for which an analyte assay is desired. The sample can be a biological sample, 
such as a biological fluid or a biological tissue. Examples of biological fluids 
include urine, blood, plasma, serum, saliva, semen, stool, sputum, cerebral 
5 spinal fluid, tears, mucus, amniotic fluid or the like. Biological tissues are 

aggregates of cells, usually of a particular kind together with their intercellular 
substance that form one of the structural materials of a human, animal, plant, 
bacterial, fungal or viral structure, including connective, epithelium, muscle and 
nerve tissues. Examples of biological tissues also Include organs, tumors, lymph 

10 nodes, arteries and individual celKs). 

As used herein, to hybridize under conditions of a specified stringency is 
used to describe the stability of hybrids formed between two single-stranded 
DNA fragments and refers to the conditions of ionic strength and temperature at 
which such hybrids are washed, following annealing under conditions of 

15 stringency less than or equal to that of the washing step. Typically high, 

medium and low stringency encompass the following conditions or equivalent 
conditions thereto: 

1) high stringency: 0.1 x SSPE or SSC, 0.1% SDS, 65°C 

2) medium stringency: 0.2 x SSPE or SSC, 0.1 % SDS, 50 °C 
20 3) low stringency: 1 .0 x SSPE or SSC, 0.1 % SDS, 50°C. 

Equivalent conditions refer to conditions that select for substantially the same 
percentage of mismatch In the resulting hybrids. Additions of ingredients, such 
as formamide, Ficoll, and Denhardt's solution affect parameters such as the 
temperature under which the hybridization should be conducted and the rate of 

25 the reaction. Thus, hybridization in 5 X SSC, in 20% formamide at 42° C is 
substantially the same as the conditions recited above hybridization under 
conditions of low stringency. The recipes for SSPE, SSC and Denhardt's and 
the preparation of deionized formamide are described, for example, in Sambrook 
eta/. (1989) Molecular Cloning, A Laboratory Manual. Cold Spring Harbor 

30 Laboratory Press, Chapter 8; see, Sambrook etai. vol. 3, p. B.13, see, also, 
numerous catalogs that describe commonly used laboratory solutions). It is 
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understood that equivalent stringencies can be achieved using alternative 
buffers, salts and temperatures. 

The terms substantially identical or similar varies with the context as 
understood by those skilled in the relevant art and generally means at least 40, 
5 60, 80, 90, 95 or 98%. 

As used herein, substantially identical to a product means sufficiently 
similar so that the property of interest is sufficiently unchanged so that the 
substantially identical product can be used in place of the product. 

As used herein, target cell refers to a cell that expresses a cell surface 
10 protease. 

As used herein, test substance, including compounds provided herein, 
refers to a chemically defined compound {e.g., organic molecules, inorganic 
molecules, organic/inorganic molecules, proteins, peptides, nucleic acids, 
oligonucleotides, lipids, polysaccharides, saccharides, or hybrids among these 

15 molecules such as glycoproteins, etc.) or mixtures of compounds {e.g., a library 
of test compounds, natural extracts or culture supernatants, etc.) whose effect 
on or interaction with a cell surface protein or cell surface-associated protein, or 
a domain thereof, is determined by the methods herein. 

As used herein, the terms a therapeutic agent, therapeutic regimen, 

20 radioprotectant, chemotherapeutic mean conventional drugs and drug therapies, 
including vaccines, which are known to those skilled in the art. 
Radiotherapeutic agents are well known in the art. 

As used herein, vector (or plasmid) refers to discrete elements that are 
used to introduce heterologous DNA into cells for expression and/or replication 

25 thereof. The vectors typically remain episomal, but can be designed to effect 
integration of a gene or portion thereof into a chromosome of the genome. Also 
contemplated are vectors that are artificial chromosomes, such as yeast artificial 
chromosomes and mammalian artificial chromosomes. Selection and use of such 
vehicles are well known to those of skill in the art. An expression vector 

30 includes vectors capable of expressing DNA that is operatively linked with 

regulatory sequences, such as promoter regions, that are capable of effecting 
expression of such DNA fragments. Thus, an expression vector refers to a 



wo 02/095007 



PCTAJS02/16819 



-40- 

recombinant DNA or RNA construct, such as a plasmid, a phage, recombinant 
virus or other vector that, upon introduction into an appropriate host cell, results 
in expression of the cloned DNA. Appropriate expression vectors are well 
known to those of skill in the art and include those that are replicable In 
5 eukaryotic cells and/or prokaryotic cells and those that rennain episomat or those 
which integrate into the host cell genome. 

As used herein, chemically stable means that the compound is stable 
enough to be formulated for pharmaceutical use. Such chemical stability is well 
known to those of skill in the art and can be determined by well known routine 
10 methods. Whether a given compound is chemically stable enough to be 

formulated for pharmaceutical use depends on a number of factors including, but 
not limited to, the type of formulation and route of administration desired, the 
disease to be treated, and the method of preparing the pharmaceutical 
formulation. 

15 As used herein, a "functional equivalent" of a side chain of an amino acid 

is a group or moiety that functions in substantially the same way as the 
naturally occurring side chain to achieve substantially the same result {e.g., a 
substrate for a cell surface protease). For example, functional equivalents of the 
side chain of arginine include, but are not limited to, hpmoarginlne, 

20 guanidinoaminopropyl, guanidinoaminoethyl, (Me)2arginine side chain, 

{Et)2arginine side chain, {4-aminomethyl)phenylmethyl, 4-amidinophenylmethyl, 
4-guanidlnophenylmethyl, or a conformationally constrained arginine side chain 
analog such as: 



25 NH 




35 
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where x is 0 or 1 (see, e.g., Webb et al. (1991) J. Org. Chem. 56:3009), or a 
conformationaily constrained arginine side chain analog such as: 




where d Is an integer from 0 to 5, or 1 to 3; and W is N or CH; or a mono- or di- 

20 substituted N-alkyI derivative of the above groups, where alkyi is, in certain 
embodiments, lower alkyI, such as methyl. 

As used herein, pharmaceutically acceptable derivatives of a compound 
include salts, esters, enol ethers, enol esters, acids, bases, solvates, hydrates or 
prodrugs thereof. Such derivatives can be readily prepared by those of skill in 

25 this art using known methods for such derivatization. The compounds produced 
can be administered to animals or humans without substantial toxic effects and 
either are pharmaceutically active or are prodrugs. Pharmaceutically acceptable 
salts include, but are not limited to, amine salts, such as but not limited to N,N'- 
dibenzylethylenediamine, chloroprocaine, choline, ammonia, diethanolamine and 

30 other hydroxyalkylamines, ethylenediamine, N-methylglucamine, procaine, N- 
benzylphenethylamine, 1 -para-chlorobenzyl-2-pyrrolidln-1 '-ylmethyl- 
benzimidazole, diethylamine and other alkytamines, piperazine and tris(hydroxy- 
methyDaminomethane; alkali metal salts, such as but not limited to lithium, 
potassium and sodium; alkali earth metal salts, such as but not limited to 

35 barium, calcium and magnesium; transition metal salts, such as but not limited 
to zinc; and other metal salts, such as but not limited to sodium hydrogen 
phosphate and disodium phosphate; and also including, but not limited to, salts 
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of mineral acids, such as but not limited to hydrochlorides and sulfates; and 
salts of organic acids, such as but not limited to acetates, lactates, malates, 
tartrates, citrates, ascorbates, succinates, butyrates, valerates and fumarates. 
Pharmaceuticaily acceptable esters include, but are not limited to, alkyi, alkenyl, 
5 alkynyl, aryl, heteroaryl, aralkyi, heteroaralkyl, cycloalkyl and heterocyclyl esters 
of acidic groups, including, but not limited to, carboxylic acids, phosphoric 
acids, phosphinic acids, sulfonic acids, sulfinic acids and boronic acids. 
Pharmaceuticaily acceptable enol ethers include, but are not limited to, 
derivatives of formula C = C{OR) where R is hydrogen, alkyI, alkenyl, alkynyl, 

10 aryl, heteroaryl, aralkyi, heteroaralkyl, cycloalkyl or heterocyclyl. 

Pharmaceuticaily acceptable enol esters include, but are not limited to, 
derivatives of formula C = C(OC(0)R) where R is hydrogen, alkyl, alkenyl, 
alkynyl, aryl, heteroaryl, aralkyi, heteroaralkyl, cycloalkyl ar heterocyclyl. 
Pharmaceuticaily acceptable solvates and hydrates are complexes of a 

15 compound with one or more solvent or water molecule, generally 1 to about 
100, typically 1 to about 10, such as 1 to about 2, 3 or 4, solvent or water 
molecules. 

As used herein, treatment means any manner in which one or more of 
the symptoms of a condition, disorder or disease are ameliorated or otherwise 

20 beneficially altered. Treatment also encompasses any pharmaceutical use of the 
compositions herein, such as use for treating cancer. 

As used herein, amelioration of the symptoms of a particular disorder by 
administration of a particular pharmaceutical composition refers to any 
lessening, whether permanent or temporary, lasting or transient that can be 

25 attributed to or associated with administration of the composition. 

As used herein, a prodrug is a compound that, upon in vivo 
administration, is metabolized or otherwise converted to the biologically, 
pharmaceuticaily or therapeutically active form of the compound. To produce a 
prodrug, the pharmaceuticatly active compound is modified such that the active 

30 compound is regenerated by metabolic processes. The prodrug can be designed 
to alter the metabolic stability or the transport characteristics of a drug, to mask 
side effects or toxicity, to improve the flavor of a drug or to alter other 
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characteristics or properties of a drug. By virtue of knowledge of 
pharmacodynamic processes and drug metabolism in vivo, those of skill in this 
art, once a pharmaceutically active compound is known, can design prodrugs of 
the compound (see, e.g., Nogrady (19851 Medicinai Chemistry A Biochemical 
5 Approach, Oxford University Press, New York, pages 388-392). 

It is to be understood that the conjugates provided herein can contain 
chiral centers. Such chiral centers can be of either the (R) or (S) configuration, 
or can be a mixture thereof. Thus, the compounds provided herein can be 
enantiomerically pure, or be stereoisomeric or diastereomeric mixtures. In the 
10 case of amino acid residues, such residues can be of either the L- or D-form. 
The configuration for naturally occurring amino acid residues is generally L. 
When not specified the residue is the L form. It is to be understood that the 
chiral centers of the compounds provided herein can undergo epimerization in 
vivo. As such, one of skill in the art will recognize that administration of a 
15 compound in its (R) form is equivalent, for compounds that undergo 

epimerization in vivo, to administration of the compound in its (S) form. 

The conjugates provided herein are prodrugs because they include a 
therapeutic agent in an inactive form that is ultimately converted to an active 
form at the targeted cell or tissue or In the environment thereof. Upon exposure 
20 to targeted protease either a biologically, pharmaceutically or therapeutically 
active form of a compound is released, or, a derivative that can be further 
nietabolized into a biologically, pharmaceutically or therapeutically active form of 
a compound. 

As used herein, substantially pure means sufficiently homogeneous to 
25 appear free of readily detectable impurities as determined by standard methods 
of analysis, such as thin layer chromatography (TLC), gel electrophoresis, high 
performance liquid chromatography (HPLC) and mass spectrometry (MS), used 
by those of skill in the art to assess such purity, or sufficiently pure such that 
further purification would not alter the physical and chemical properties, such as 
30 enzymatic and biological activities, of the substance for its intended purpose. 
Methods for purification of the compounds to produce substantially chemically 
pure compounds are known to those of skill in the art. A substantially 
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chemically pure compound may, however, be a mixture of stereoisomers. In 
such instances, further purification might increase the specific activity of the 
compound. 

As used herein, a peptidic substrate includes peptides and molecules, 
5 such as peptide mimetics and peptides that include peptide bond surrogates. 
As used herein, conventional terminology (Schecter era/. (1967) 
Biochem. Biophys. Res. Commun. 27:157-162) is used to refer to specific 
subsites of a protease substrate: 

Pn...P3-P2-P1 4 Pl'-P2'-P3'...Pn'. The scissile bond (i.e., the cleavage site) of a 
10 substrate is indicated by the arrow. Positions N-terminal of that bond are 

referred to as unprimed positions. Subsites are then assigned a number based 
on their distance from the scissile bond. Amino acids (or amino acid surrogates) 
that form the scissile bond are assigned the number 1 , adjacent residues the 
number 2, and so on, counting away from the scissile bond. Each specific 
15 subsite of the substrate, therefore, is uniquely identified by a number and the 
designation as primed or unprimed. 

As used herein, a surrogate of a peptide bond is a divalent group that 
possesses similar steric and/or electronic characteristics to -C(0)NH-. Peptide 
bond surrogates include, but are not limited to, alkene isosteres (-CR = CR-), 
20 particularly {E)-alkene isosteres of formula -CH = CH-, hydroxyethylene isosteres 
(-CH{OH)CH2-), enamine isosteres (-C( = CRR)NH-), aminoalcohol isosteres 
(-CHiOHlCHjNH-l, difluoroketone isosteres (-C(0)CF2-), retroinverso compounds 
(-NHC(O)-), divalent heterocyclyl or heteroaryl groups, and cyclopropyl isosteres 
such as: 




As used herein, alkyi, alkenyl and alkynyl carbon chains, if not specified, 
contain from 1 to 20 carbons, generally 1 to 1 6 carbons, and are straight or 
branched. Alkenyl carbon chains of from 2 to 20 carbons typically contain 1 to 
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8 double bonds, and the alkenyl carbon chains of 2 to 16 carbons and typically 
contain 1 to 5 double bonds. Alkynyl carbon chains of from 2 to 20 carbons 
typically contain 1 to 8 triple bonds, and the alkynyl carbon chains of 2 to 1 6 
carbons and generally contain 1 to 5 triple bonds. Exemplary alkyi, alkenyl and 
5 alkynyl groups herein include, but are not limited to, methyl, ethyl, propyl, 

isopropyl, isobutyl, n-butyl, sec-buty), tert-butyl, isopentyl, neopentyl, tert-pentyl 
and isohexyl. The alkyl, alkenyl and alkynyl groups, unless otherwise specified, 
optionally can be substituted, with one or more groups, generally aikyl group 
substituents that are the same or different. As used herein, lower alkyI, lower 

10 alkenyl, and lower alkynyl refer to carbon chains having less than about 6 

carbons. As used herein, "atk(en)(yn)yl" refers to an alkyI group containing at 
least one double bond and at least one triple bond. 

As used herein, "cycloalkyi" refers to a saturated mono- or multicyclic 
ring system, typically 3 to 1 0 carbon atoms, such as, for example, 3 to 6 

15 carbon atoms; cycloalkenyi and cycloalkynyl refer to mono- or multicyclic ring 
systems that respectively Include at least one double bond and at least one triple 
bond. Cycloalkenyi and cycloalkynyl groups contain, for example, 3 to 10 
carbon atoms, with cycloalkenyi groups generally containing 4 to 7 carbon 
atoms and cycloalkynyl groups that contain, for example 8 to 10 carbon atoms. 

20 The ring systems of the cycloalkyi, cycloalkenyi and cycloalkynyl groups can be 
composed of one ring or two or more rings which can be joined together in a 
fused, bridged or spiro-connected fashion, and optionally can be substituted 
with one or more alkyI group substituents. "Cycloalkienllynlyl" refers to a 
cycloalkyi group containing at least one double bond and at least one triple 

25 bond. 

As used herein, "substituted alkyl," "substituted alkenyl," "substituted 
alkynyl," "substituted cycloalkyi," "substituted cycloalkenyi," and "substituted 
cycloalkynyl" refer to alkyI, alkenyl, alkynyl, cycloalkyi, cycloalkenyi and 
cycloalkynyl groups, respectively, that are substituted with one or more 
30 substituents, in certain embodiments one to three substituents, independently 
selected from alkyI, halo, haloalkyl, such as halo lower alkyI, pseudohalo, aryl, 
amino, dialkylamino, nitro, cyano, azido, alkylsulfinyl, alkylsulfonyl. 
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alkylcarbonylamino, aikoxycarbonylamino, aminoimino, hydroxy, alkoxy, aryloxy, 
aikyloxy, alkylthio, arylthio, aralkyloxy, aralkylthio, carboxy, alkylcarbonyl, 
alkoxycarbonyl, oxo and cycloalkyl. 

As used herein, "aryl" refers to cyclic groups containing from 6 to 1 9 
5 carbon atoms. Ary! groups include, but are not limited to groups, such as 
fluorenyl, substituted fluorenyl, phenyl, substituted phenyl, naphthyl and 
substituted naphthyl. As used herein, "aryl" also refers to aryl-containing 
groups, including, but not limited to, aryloxy, arylthio, arylcarbonyl and 
arylamino groups. 

10 As used herein, "heteroaryl" refers to a monocyclic or multicyclic 

aromatic ring system, generally about 5 to about 1 5 members where one or 
more, such as 1 to 3 of the atoms in the ring system is a heteroatom, that is, an 
element other than carbon, for example, nitrogen, oxygen and sulfur atoms. 
The heteroaryl group optionally can be fused to a benzene ring. Exemplary 

15 heteroaryl groups include, for example, furyl, imidazolyl, pyrrolidinyl, pyrimidinyl, 
tetrazolyl, thienyl, pyridyl, pyrrolyl, N-methylpyrrolyl, quinolinyl and isoquinolinyl, 
with pyridyl, thienyl and quinolinyl as examples thereof. 

As used herein, "heteroaryl" also refers to heteroaryl-containing groups, 
including, but not limited to, heteroaryloxy, heteroarylthio, heteroarylcarbonyl 

20 and heteroarylamino. 

As used herein, "heterocyclyl" refers to a monocyclic or multicyclic non- 
aromatic ring system, such as systems of 3 to 10 members, for exmaple 4 to 7 
members or 5 to 6 members, where one or more, such as 1 to 3 of the atoms in 
the ring system is a heteroatom, that is, an element other than carbon, for 

25 example, nitrogen, oxygen and/or sulfur atoms. 

As used herein, "substituted aryl," "substituted heteroaryl" and 
"substituted heterocyclyl" refer to aryl, heteroaryl and heterocyclyl groups, 
respectively, that are substituted with one or more substltuents, in certain 
embodiments one to three substituents, independently selected from alkyi, 

30 cycloalkyl, cycloalkylalkyi, aryl, heteroaryl optionally substituted with 1 or more, 
such as 1 to 3, substituents selected from halo, halo alkyl and alky!, aralkyi, 
heteroaralkyl, alkenyl containing 1 to 2 double bonds, alkynyl containing 1 to 2 
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tripie bonds, alk(en)(yn)yl groups, halo, pseudohalo, cyano, hydroxy, haloalkyi 
and polyhaloalkyi, such as halo lower alkyi, especially trifluoromethyl, formyl, 
alkylcarbonyl, arylcarbonyl that optionally is substituted with 1 or more, 
generally 1 to 3, substituents selected from halo, halo alky! and alkyI, 
5 heteroarylcarbonyl, carboxy, alkoxycarbonyl, aryloxycarbonyl, aminoimino, 

alkoxycarbonylamino, aryloxycarbonylamino, aminocarbonyl, alkylaminocarbonyl, 
dialkylaminocarbonyl, arylaminocarbonyl, diarylaminocarbonyl, aralkylaminocar- 
bonyl, alkoxy, aryloxy, perfluoroalkoxy, alkenyloxy, alkynyloxy, arylalkoxy, 
aminoalkyi, alkylaminoalkyi, dialkylanainoaikyi, arylaminoalkyi, amino, alkylamino, 
10 dialkylamino, arylamino, alkylarylamino, alkylcarbonylamino, arylcarbonylamino, 
azido, nitro, mercapto, alkylthio, arytthio, perfluoroalkylthio, thiocyano, 
isothiocyano, alkylsulfinyl, alkylsulfonyl, arylsulfinyl, arylsuifonyl, aminosulfonyl, 
alkylaminosulfonyl, dialkylaminosulfonyl and arylaminosuifonyl. 

As used herein, "aralkyl" refers to an alkyI group in which one of the 
15 hydrogen atoms of the alkyI is replaced by an aryl group. 

As used herein, "heteroaralkyl" refers to an alkyl group in which one of 
the hydrogen atoms of the alkyl is replaced by a heteroaryl group. 

As used herein^ the nomenclature alkyl, alkoxy, carbonyl, etc. is used as 
is generally understood by those of skill in this art. For example, as used herein 
20 alkyl refers to saturated carbon chains that contain one or more carbons; the 
chains can be straight or branched or include cyclic portions or be cyclic. 

Where the number of any given substituent is not specified {e.g., 
"haloalkyi"), there can be one or more substituents present. For example, 
"haloalkyi" can include one or more of the same or different halogens. As 
25 another example, "C^.aalkoxyphenyl" can include one or more of the same or 
different alkoxy groups containing one, two or three carbons. 

As used herein, "halo", "halogen" or "halide" refers to F, CI, Br or I. 

As used herein, pseudohalides are compounds that behave substantially 
similar to halides. Such compounds can be used in the same manner and 
30 treated in the same manner as halides (X', in which X is a halogen, such as CI or 
Br). Pseudohalides include, but are not limited to, cyanide, cyanate. 
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thiocyanate, selenocyanate, trifluoromethoxy, difluoromethoxy, dichloromethoxy 
and azide. 

As used herein, "haloalkyt" refers to a lower alkyi radical in which one or 
more of the hydrogen atoms are replaced by halogen. Such groups include, but 
5 not limited to, chloromethyl, trifluoromethyl, 1-chIoro-2-fluoroethyl and the like. 
As used herein, "haloalkoxy" refers to RO- in which R is a haloalkyi 

group. 

As used herein, "sulfinyl" or "thionyl" refers to -S(0)-. As used herein, 
"sulfonyl" or "sulfuryl" refers to -SfOj-. As used herein, "sulfo" refers to - 
10 SfOjO-. 

As used herein, "carboxy" refers to a divalent radical, -C(0)0-. 

As used herein, "aminocarbonyl" refers to -C(0)NH2. 

As used herein, "alkylaminocarbonyr refers to -C(0)NHR in which R is 
hydrogen or alkyI, such as, for example, lower alkyl. 
15 As used herein "dialkylaminocarbonyl" as used herein refers to-C(0)NR'R 

in which R' and R are independently selected from hydrogen or alkyl, such as, 
for example, lower alkyl; "carboxamide" refers to groups of formula -NR'COR. 

As used herein, "diarylaminocarbonyl" refers to -C(0)NRR' in which R and 
R' are independently selected from aryl, such as lower aryl, for example, phenyl. 
20 As used herein, "aralkylaminocarbonyl" refers to -C{0)NRR' in which one 

of R and R' is aryl, such as, lower aryl, for example, phenyl, and the other of R 
and R' is alkyl, such as, for example, lower alkyl. 

As used herein, "arylaminocarbonyr refers to -C(0)NHR in which R is 
aryl, such as lower aryl, for example, phenyl. 
25 As used herein, "hydroxycarbonyl" refers to -COOH. ^ 

As used herein, "alkoxycarbonyl" refers to -C(0)OR in which R is alkyl, 
such as lower alkyl. 

As used herein, "aryloxycarbonyl" refers to -C(0)OR in which R is aryl, 
such lower aryl, for example phenyl. 
30 ■ As used herein, "alkoxy" and "alkylthio" refer to RO- and RS-, in which R 

is alkyl, such as, for example, lower alkyt. 
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As used herein, "aryloxy" and "arylthio" refer to RO- and RS-, in which R 
is aryl, such lower aryl, for example, phenyl. 

As used herein, "alkylene" refers to a straight, branched or cyclic, such 
as, for example, straight or branched, divalent aliphatic hydrocarbon group, for 
5 example, having from 1 to about 20 carbon atoms such as 1 to 1 2 carbons, 
and for exmaple, is lower alkylene. There optionally can be inserted along the 
alkylene group one or more oxygen, sulphur or substituted or unsubstituted 
nitrogen atoms, where the nitrogen substituent is alkyi as previously described. 
Exemplary alkylene groups include methylene {-CH^-), ethylene (-CH2CH2-), 
10 propylene {— (CHala-), cyclohexylene {-CqH^q-), methylenedioxy (-O-CH2-O-) and 
ethylenedioxy (-0-{CH2)2-0-). The term "lower alkylene" refers to alkylene 
groups having 1 to 6 carbons. Exemplary alkylene groups are lower alkylene, 
such as, for example, alkylene of 1 to 3 carbon atoms. 

As used herein, "alkenylene" refers to a straight, branched or cyclic, 

1 5 typically straight or branched, divalent aliphatic hydrocarbon group, such as, for 
example, having from 2 to about 20 carbon atoms and at least one double bond, 
generally 1 to 1 2 carbons, and is for example, lower alkenylene. There 
optionally can be inserted along the alkenylene group one or more oxygen, 
sulphur or substituted or unsubstituted nitrogen atoms, where the nitrogen 

20 substituent is alkyI as previously described. Exemplary alkenylene groups 

include — CH = CH— CH = CH— and -CH = CH-CH2-- The term "lower alkenylene" 
refers to alkenylene groups having 2 to 6 carbons. Examplary alkenylene groups 
are lower alkenylene, such as, for example, alkenylene of 3 to 4 carbon atoms. 
As used herein, "alkynylene" refers to a straight, branched or cyclic, 

25 generally straight or branched, divalent aliphatic hydrocarbon group, such those 
having from 2 to about 20 carbon atoms and at least one triple bond, generally 
1 to 1 2 carbons, such as, for example, lower alkynylene. There optionally can 
be inserted along the alkynylene group one or more oxygen, sulphur or 
substituted or unsubstituted nitrogen atoms, where the nitrogen substituent is 

30 alkyI as previously described. Exemplary alkynylene groups include 

— CsC— CbC— ,-CaC-and -CsC-CHj-. The term "lower alkynylene" refers to 
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alkynytene groups having 2 to 6 carbons. Exemplary aikynylene groups are 
lower aikynylene, such as, for example, aikynylene of 3 to 4 carbon atoms. 

As used herein, "aik(en)(yn)ylene" refers to a straight, branched or cyclic, 
generally straight or branched, divalent aliphatic hydrocarbon group, having, for 
5 example, from 2 to about 20 carbon atoms and at least one triple bond, and at 
least one double bond; typically 1 to 1 2 carbons, such as, for example, lower 
a!k{en){yn)ylene. There optionally can be inserted along the aikynylene group 
one or more oxygen, sulphur or substituted or unsubstituted nitrogen atoms, 
where the nitrogen substituent is alkyi as previously described. Exemplary 

10 alkten)(yn)ylene groups include — C = C— (CH2)n-C = C — , where n is 1 or 2. The 
term "lower alk(en)(yn)ylene" refers to alk(en)(yn)ylene groups having up to 6 
carbons. Exemplary alk(en)(yn)ylene groups are lower alk{en){yn)ylene, such as, 
for example, alk(en)(yn)ylene of 4 carbon atoms. 

As used herein, "cycloalkylene" refers to a divalent saturated mono- or 

15 multicyclic ring system, generally 3 to 10 carbon atoms, such as 3 to 6 carbon 
atoms; cycloalkenylene and cycloalkynylene refer to divalent mono- or 
multicyclic ring systems that respectively include at least one double bond and 
at least one triple bond. Cycloalkenylene and cycloalkynylene groups can 
contain 3 to 1 0 carbon atoms, with, for example, cycloalkenylene groups 

20 containing 4 to 7 carbon atoms and cycloalkynylene groups containing 8 to 10 
carbon atoms. The ring systems of the cycloalkylene, cycloalkenylene and 
cycloalkynylene groups can be composed of one ring or two or more rings that 
can be joined together in a fused, bridged or spiro-connected fashion. 
"Cycloalk(en)(yn)ylene" refers to a cycloalkylene group containing at least one 

25 double bond and at least one triple bond. 

As used herein, "substituted alkylene," "substituted alkenylene," 
"substituted aikynylene," "substituted cycloalkylene," "substituted cyclo- 
alkenylene," and "substitued cycloalkynylene" refer to alkylene, aikenylene, 
aikynylene, cycloalkylene, cycloalkenylene and cycloalkynylene groups, 

30 respectively, that are substituted with one or more substituents, in certain 
embodiments one to three substituents, independently selected from halo, 
haloalkyi, such as, for example, halo lower alkyl, aryl, hydroxy, alkoxy, aryloxy. 
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alkyloxy, alkylthio, arylthio, aralkyloxy, aralkylthio, carboxy alkoxycarbonyl, oxo 
and cycloalkyl. 

As used herein, "arylene" refers to a monocyclic or polycyclic, such as 
monocyclic, divalent aromatic group, for example, having from 5 to about 20 
5 carbon atoms and at least one aromatic ring, such as 5 to 12 carbons, and, is, 
for example, lower arylene. There optionally can be inserted around the arylene 
group one or more oxygen, sulphur or substituted or unsubstituted nitrogen 
atoms, where the nitrogen substituent Is alkyi as previously described. 
Exemplary arylene groups include 1,2-, 1,3- and 1 ,4-phenylene. The term 

10 "lower arylene" refers to arylene groups having 5 or 6 carbons. Exemplary 
arylene groups are lower arylene. 

As used herein, "heteroarylene" refers to a divalent monocyclic or 
multicych'c aromatic ring system, such as of about 5 to about 1 5 members 
where one or more, typically, for example, 1 to 3 of the atoms In the ring 

15 system is a heteroatom, that is, an element other than carbon, for example, 
nitrogen, oxygen and/or sulfur atom{s). 

As used herein, "heterocyclylene" refers to a divalent monocyclic or 
multicyclic non-aromatic ring system, generally of 3 to 1 0 members, such as, for 
example, 4 to 7 members or 5 to 6 members, where one or more, such as, for 

20 example, 1 to 3 of the atoms in the ring system is a heteroatom, that is, an 

element other than carbon, for example, nitrogen, oxygen and/or sulfur atom{s). 

As used herein, "substituted arylene," "substituted heteroarylene" and 
"substituted heterocyclylene" refer to arylene, heteroarylene and heterocyclylene 
groups, respectively, that are substituted with one or more substituents, in 

25 certain embodiments one to three substituents, independently selected from 

alkyl, cycloalkyl, cycloalkylalkyl, aryl, heteroaryl optionally substituted with 1 or 
more, such as 1 to 3, substituents selected from halo, halo alkyl and alkyl, 
aralkyi, heteroaralkyl, alkenyl containing 1 to 2 double bonds, alkynyl containing 
1 to 2 triple bonds, alk(en)(yn)yl groups, halo, pseudohalo, cyano, hydroxy, 

30 haloalkyl and polyhaloalkyi, such as, halo lower alkyl, for example 

trifluoromethyl, formyl, alkylcarbonyl, arylcarbonyl that optionally is substituted 
with 1 or more, such as 1 to 3, substituents selected from, for example, halo, 
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halo alkyi and alkyl, heteroarylcarbonyl, carboxy, alkoxycarbonyl, 
aryloxycart>onyl, aminocarbonyl, alkylaminocarbonyl, dialkylaminocarbonyl, 
arylaminocarbonyl, diarylaminocarbonyl, aralkylaminocarbonyl, alkoxy, aryloxy, 
perfluoroalkoxy, alkenyloxy, alkynyloxy, arylalkoxy, aminoalkyl, alkytaminoalkyl, 
5 dialkylaminoalkyl, arylaminoalkyi, amino, aikylamino, dialkylamino, arylamino, 
alkylarylamino, alkylcarbonylamino, arylcarbonylamino, azido, nitro, mercapto, 
alkylthio, arylthio, perfluoroalkylthio, thiocyano, isothiocyano, alkylsulfinyl, 
alkylsulfonyl, arylsulfinyl, arylsulfonyl, aminosulfonyl, alkylaminosulfonyl, 
diaikylaminosulfonyl and arylaminosulfonyl. 

10 As used herein, "alkylidene" refers to a divalent group, such as =CR'R", 

which is attached to one atom of another group, forming a double bond. 
Exemplary alkylidene groups are methylidene {-CH2) and ethylidene (^CHCHj). 
As used herein, "aralkylidene" refers to an alkylidene group in which either R' or 
R" is an aryl group. "Cycloalkylidene" groups are those where R' and R" are 

15 linked to form a carbocyclic ring, "Heterocyclylidene" groups are those where at 
least one of R' and R" contain a heteroatom in the chain, and R' and R" are 
linked to form a heterocyclic ring. 

As used herein, "amido" refers to the divalent group -C(0)NH-. 
"Thioamido" refers to the divalent group -C(S)NH-. "Oxyamido" refers to the 

20 divalent group -OC{0)NH-. "Thiaamido" refers to the divalent group -SC(0)NH-. 
"Dithiaamido" refers to the divalent group -SC{S)NH-. "Ureido" refers to the 
divalent group -HNC(0)NH-. "Thioureido" refers to the divalent group - 
HNC(S)NH-. 

As used herein, "semicarbazide" refers to -NHC{0)NHNH-. "Carbazate" 
25 refers to the divalent group -OC(0)NHNH-. "Isothiocarbazate" refers to the 
divalent group -SC(0)NHNH-. "Thiocarbazate" refers to the divalent group - 
OC(S)NHNH-. "Sulfonylhydrazide" refers to the group -SO2NHNH-. "Hydrazide" 
refers to the divalent group -C<0)NHNH-. "Azo" refers to the divalent group 
-N = N-. "Hydrazinyr refers to the divalent group -NH-NH-. 
30 As used herein, the term "amino acid" refers to a-amino acids which are 

racemic, or of either the D- or L-configuration. The designation "d" preceding an 
amino acid designation {e.g., dAla, dSer, dVal, etc.) refers to the D-isomer of the 
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amino acid. The designation "dl" preceding an amino acid designation [e.g., 
dIPIp) refers to a mixture of the L- and D-isomers of the amino acid. 

As used herein, when any particular group, such as phenyl or pyridyl, is 
specified, this means that the group is unsubstituted or is substituted. 
5 Exemplary substituents where not specified are halo, halo lower allcyl, and lower 
alkyt. 

As used herein, the abbreviations for any protective groups, amino acids 
and other compounds, are, unless indicated otherwise, in accord with their 
common usage, recognized abbreviations, or the lUPAC-lUB Commission on 
10 Biochemical Nomenclature (see, (1972) Biochem. 7 7:942-944). 

As used herein, HHT and CHT refer to h e xahyd rot y rosy 1 (also known as 
cyclohexyltyrosyi or p-hydroxycyclohexylalanyl), CHA is cyclohexylalanyl, Pyr 
and pyroGIu refer to pyroglutamic acid, Pip is pipecolinic acid, Sar is sarcosine, 
nLeu and Nle are norleucine, nVal is norvaline, Aib is 2-aminoisobutyric acid, 
1 5 Quat is (R)-Glu(or-(3-amidinobenzyl)), and Abu and But are 2-aminobutyric acid. 
As used herein, PEG represents a polyethylene glycol containing 
substituent having the designated number of ethyleneoxy subunits. Thus, the 
term PEG(2) represents: 



20 



O 




25 and the term PEG(6) represents: 



30 




When R' and are combined to form -[CH^)^-, the cyclic moieties and 
heteroatom-containing cyclic moieties so defined include, but are not limited to: 
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5 



10 



15 



20 




the 



term "hydroxylated" represents substitution on a substitutable carbon of the ring 
system being so described by a hydroxy! moiety. As used herein, the term "poly- 
25 hydroxyJated" represents substitution on two or more substitutable carbons of 
the ring system being so described by 2, 3 or 4 hydroxy! moieties. 

As used herein, the term "(d)(2,3-dihydroxypropionyl)" represents the 
following structure: 




o 

35 



40 
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10 



As used herein, the term "{2R,3S)-2,3,4-trihvdroxybutanoy!" represents 
the following structure: 
OH O 

HO 



OH 



As used herein, the term "quinyf" represents the following structure: 
O 




15 



HOlt 



HOi' 



HO 




OH 



20 or a diastereomer thereof. 

As used herein, the term "gulonyl" represents the following structure: 
O 

u 



2B O 



HO 




OH 
OH 



30 



35 



or a diastereomer thereof. 

As used herein, the term "cotinlnyl" represents the following structure: 




40 H3C-N 




45 
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5 

or a diastereomer thereof. 

As used herein, the term "gallyl" represents the following structure: 
O 




As used herein, the term "4-ethoxvsquaryr represents the following 
structure: 



20 



EtO 



/ \ 




25 



O 



O 



As used herein, 1-methylHis or (1Me)H refers to the structure: 



30 



35 




As used herein, 3-methylHis or (3Me)H refers to the structure: 



40 



45 
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-|-NH C{0)-|- 



10 



As used herein, Quat^ refers to: 
CN 



COOMe 



-j-NH C(0)-|- 



15 Quat^ refers to: 



NH 



20 




25 



f 



NH 




NK 



COOMe 



30 Quat* refers to: 



35 




Me 



40 



f 



NH 




COOMe 



45 
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; and 



1 




15 

Other abbreviations as used herein are as follows: 



Abbreviation Refers to 

Aib 2-aminoisobutyryl 

20 4,4-dimethylThr 2-annino-3-hydroxy-4-methylpentanoyl 

Met(02) nnethioninyl-S,S-dioxide 

SeriOIVIe) the O-methyl ether of serinyl, also known as 2- 

annino-3-nnethoxypropanoyl 

hSer homoserinyl, also known as 2-amino-4- 

25 hydroxybutanoyl 

(hS)Gly N-(2-hydroxyethyl)glycyl 

IM,N-dimethvlGIy N,N-dimethylglycyl 

)9-Ala 3-aminopropanoyl 

Cys(Me) S-methylcysteinyl 

30 t-butyl6Iy 2-amino-3,3-dimethylbutanoyl 

F{Gn) 4-guanldinylphenylalanyl 

hCHA homocyclohexylalanyl, or 2-amino-4- 

cyclohexylbutanoyl 

hexylGly 2-anninooctanoyl 

35 allylGly 2-amino-4-pentenoyl 

Inact. inactive 

NT not tested 
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10 



15 



20 



25 



30 



MeOEtCO 

3,4-MethyldioxyPhAc 

L-3-PhLactyl 

MeOEtOCO 

MeOCO 

MeO(EtO)2Ac 

2-PyridylAc 

PhOAc 

MeOAc 

PhAc 

MeOEtOAc 
HOOCButa 
2 

EtOCO 

M 

NapAc 

iBoc 

HOAc 

MeSucc 

Succ 

HCO 

4-{guan)Phg 

Dox 

Tax 

dA(Chx) or dCha 

dhF 

P{OH) 



3-nnethoxypropanoyl 

3,4-nnethylenedioxyphenylacetyl 

L-2-hydroxy-3-phenylpropanoyl 

2-nnethoxyethoxycarbonyI 

methoxycarbonyl 

2-{2-methoxyethoxy)ethoxyacetyl 

2-pyridylacetyl 

phenoxyacetyl 

methoxy acetyl 

phenylacety] 

2- methoxyethoxyacetyl 
glutaryl 

benzyloxycarbonyl 
ethoxycarbonyl 

beta-aianyl or 3~aminopropanoyl 
1-naphthylacetyl 
isobutoxycarbonyl 
hydroxyacetyl 

3- methoxycarbonylpropanoyl 
succinyl 

formyl 

4- guanidinylphenylglycyl 
doxorubicin 

taxol 

d-cyclohexylalanyl 
d-homophenylalanyl 
4-hydroxy prolyl 



B. Protease targets 

The conjugates herein are designed to target proteases that are located 
on cell surfaces, particularly tumor cells and cells involved in tumorigenic 
processes and angiogenesis and other proliferative processes. The conjugates, 
described in detail below, contain a peptidic substrate for a selected targeted 
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cell surface protease linked, either directly or via a linker, to a therapeutic agent, 
typically a cytotoxic agent, which is substantially inactive when in the 
conjugate. The therapeutic agent is released in a form that is active or that can 
be activated in the vicinity of the targeted cell or tissue to which it is delivered. 
5 As a result, active therapeutic agent accumulates at the targeted cells or tissue 
or in the targeted cells. 

The targeted protease is selected by identifying a protease that is located 
on a cell or tissue (or associated therewith) that is involved in the disease 
process or serendipitously present in the locale of cells or tissues involved in the 

10 disease or disease process, and, generally, is not located at all or present or 
active at lower levels, generally substantially lower levels, or exhibits altered 
activity or specificity, on many, if not all, other cells or tissues. The variety and 
numbers of non-targeted cells or tissues that expresss the active protease varies 
for particular proteases and diseases intended for treatment. Those of skill in 

1 5 the art will select a target based upon the disease, targeted agents and tolerable 
or acceptable levels of side-effects. The goal is to achieve enhanced therapeutic 
index compared with administration of the targeted agent by itself. 

The targeted protease may or may not be involved in the disease process 
and its expression can be serendiptous; for purposes herein its particular rote or 

20 lack thereof is not important; it is the fact that it is active in the locale of 

targeted tissues or cells that is important. For example, many of the cell surface 
proteases of interest herein are expressed or active on tumor cells or cells 
involved in the tumorigenic processes. Any method known to one of skill in the 
art for determining or detecting a tissue or cell expression profile can be used. 

25 For example, RIMA blots composed of RNA from numerous tissues {e.g., a 
multiple tissue expression (MTE) array available from CLONTECH, Palo Alto, 
CA), can be screened with probes based upon the nucleic acid sequence of the 
protease of interest to identify cells that express the protease. Northern 
analysis of the blots to test for expression also can be used. 

30 Included among the targeted proteases are those designated type II 

membrane-bound serine proteases {MTSPs; see, e.g., U.S. application Serial No. 
09/776,191, filed February 2, 2001 and International PCT application No. 
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PCT/US01/03471 published as International PCT application No. WO 01/57194; 
see International PCT application No. PCT/US02/07903; see, also U.S. 
provisional application Serial Nos. 60/275,592, 60/278,166, 60/279,228, 
60/291,001, 60/291,501 60/316,818, 60/302,939, 60/316,818, 60/328,529, 
5 60/328,530, 60/332,015, 60/328,939, and provisional application, filed on 
May 20, 2002 under attorney docket no. 24745-P1624; U.S. application Serial 
Nos. 10/099,700, 10/104,271, 10/112,221, application filed on May 14, 2002 
under attorney docket no. 24745-1616) and those found on endothelial cells 
designated endotheliases (see, U.S. application Serial No. 09/717,473, filed 

10 November 20, 2000, and International PCT application No. PCT/USOO/31 803 
published as International PCT application No. WO 01/36604); see, also SEQ ID 
Nos. 3-26, 269-270 and 272-276. 

Also contennplated are proteases that are located at the cell surface by 
virtue of a specific interaction with a cell surface protein. Urokinase 

15 plasminogen activator (u-PA) bound to urokinase plasminogen activator receptor 
(u-PAR) is exemplary of such proteases. Nucleic acid sequence information and 
expression profiles of exemplary MTSPs and endotheliases are as follows (see, 
also EXAMPLE 6). 

1 . MTSPs 

20 Cell surface proteolysis is a mechanism for the generation of biologically 

active proteins that mediate a variety of cellular functions. These membrane- 
anchored proteins, include a disintegrin-like and metalloproteinase (ADAM) and 
membrane-type matrix metalloproteinase (MT-MMP). In addition to the MMPs, 
serine proteases have been implicated in neoplastic disease progression. Most 

25 serine proteases, which are either secreted enzymes or are sequestered in 

cytoplasmic storage organelles, have roles in blood coagulation, wound healing, 
digestion, immune responses and tumor invasion and metastasis. 

Transmembrane serine proteases (MTSPs) appear to be involved in the 
etiology and pathogenesis of tumors. These enzymes are expressed in certain 

30 cancerous and tumor cells and in other cells associated with other proliferative 
disorders and other disease states, such as in inflammatory cells and and can be 
tissue or organ-specific. In mammals, more than 20 members of the family are 
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known (see. Hooper et al. (2001) J. Biol. Chem. 276:857-860, see, also U.S. 
application Serial No. 09/776,191, filed February 2, 2001 and International PCT 
application No. PCT/US01 /03471 ; see, also U.S. provisional application Serial 
Nos. 60/275,592 and 60/278,166; and see SEQ ID Nos. 1-37). These include 
5 corin (accession nos. API 33845 and AB013874; see, Yan era/. (1999) J. BioL 
Chem. 274:14926-14938; Tomia et ai (1998) J. Biochem. 724:784-789; Uan 
etal. (2000) Proc. Nat/. Acad. Sc/. U.S.A. 57:8525-8529); enterpeptidase (also 
designated enterokinase; accession no. U09860 for the human protein; see, 
Kitamoto et af. (1995) Biochem. 27: 4562-4568; Yahagi eta/. (1996) Biochem. 
10 Biophys. Bes. Common. 2/9:806-812; Kitamoto eta/. (1994) Proc. Natl. Acad. 
Set. U.S.A. 97:7588-7592; Matsushima eta/. (1994) J. Biol. Chem. 
255:19976-19982;); human airway trypsin-like protease (HAT; accession no. 
AB0021 34; see Yamaoka et al. J. Bio/. Chem. 273:1 1 894-1 1 901 ); MTSP1 
(also called TADG-15 and matriptase, see SEQ ID Nos. 1 and 2; accession nos. 
15 API 33086/AF1 18224, AP04280022; Takeuchi eta/. (1999) Proc. Nat/. Acad. 
Sci. U.S.A. 95:11054-1161; Lin etal. (1999) J. Biol. Chem. 274:18231-18236; 
Takeuchi etal. (2000) J. B/oI. Chem. 275:26333-26342; and Kim etal. (1999) 
Immunogenetics 49:420-429); hepsin (see, accession nos. Ml 8930, 
AP030065, X70900; Leytus etal. (1988) Biochem. 27: 11395-11901; Vu et 
20 al. (1997) J. Biol. Chem. 272:31315-31320; and Farley eta/. (1993) Biochem. 
Biophys. Acta 7 775:350-352; and see, U.S. Patent No. 5,972,616); TMPRS2 
(see. Accession Nos. U75329 and AF1 13596; Paoloni-Giacobino eta/. (1997) 
Genomics 44:309-320; and Jacquinet et a/. (2000) FEBS Lett. 468: 93-1 00); 
and TMPRSS4 (see, Accession No. NM 016425; Wallrapp eta/. (2000) Cancer 
25 50:2602-2606). Also known MTSP3, MTSP4, MTSP6, MTSP7, MTSP9, 

MTSP10, MTSP12. MTSP20, MTSP22 and MTSP25 (see, SEQ ID NOs. 3-26, 
269-270 and 272-276; see, also U.S. application Serial No. 09/776,191, filed 
February 2, 2001 and International PCT application No. PCT/US0 1/03471 
published as International PCT application No. WO 01/57194; see International 
30 PCT application No. PCT/US02/07903; see, also U.S. provisional application 
Serial Nos. 60/275,592, 60/278,166, 60/279,228, 60/291,001, 60/291,501 
60/316,818, 60/302,939, 60/316,818, 60/328,529, 60/328,530, 
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60/332,015, 60/328,939, and provisional application, filed on May 20 2002, 
under attorney docket no. 24745-P1624; U.S. application Serial Nos. 
10/099,700, 10/104,271, 10/112,221, application filed on May 14, 2002 
under attorney docket no. 24745-1616)). 
5 Serine proteases, including transmembrane serine proteases, have been 

implicated in processes involved in neoplastic development and progression. 
While the precise role of these proteases has not been elaborated, serine 
proteases and inhibitors thereof are involved in the control of many intra- and 
extracellular physiological processes, including degradative actions in cancer celt 

10 invasion, metastatic spread, and neovascularization of tumors, that are involved 
in tumor progression. It is believed that proteases are involved in the 
degradation of extracellular matrix (ECM) and contribute to tissue remodeling, 
and are necessary for cancer invasion and metastasis. The activity and/or 
expression of some proteases have been shown to correlate with tumor 

15 progression and development, and also are shown to be active in specific cell 
types. 

For example, a membrane-type serine protease MTSP1 (also called 
matriptase; see SEQ ID Nos. 1 and 2 from U.S. Patent No. 5,972,616; and 
GenBank Accession No. AF1 18224; (1999) J. BioL Chem. 274:18231-18236; 

20 U.S. Patent No. 5,792,616; see, also Takeuchi (1999) Proc. Natl. Acad. Sc/. 
U.S.A. 55:1 1054-1 161) that is expressed in epithelial cancer and normal tissue 
(Takeucuhi eta/. (1999) Proc. Natl. Acad. Set. USA 55:1 1054-61) has been 
identified. It has been proposed that it plays a roie in the metastasis of breast 
cancer. Its primary cleavage specificity is Arg-Lys residues. Matriptase also is 

25 expressed in a variety of epithelial tissues with high levels of activity and/or 
expression in the human gastrointestinal tract and the prostate. 

Hepsin, a cell surface serine protease identified in hepatoma cells, is 
overexpressed in ovarian cancer (Tanimoto et al. (1997) Cancer Res., 
57:2884-7). The hepsin transcript appears to be abundant in carcinoma tissue 

30 and is almost never expressed in normal adult tissue, including normal ovary. It 
has been suggested that hepsin is frequently overexpressed in ovarian tumors 
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and therefore can be a candidate protease in the invasive process and growth 
capacity of ovarian tumor cells. 

A serine protease-like gene, designated normal epithelial cell-specific 1 
(NES1) {Liu etai (1996) Cancer Res. 56:3371-9) has been identified. Although 
5 expression of the NES1 mRNA is observed in all normal and immortalized 

nontumorigenic epithelial cell lines, the majority of human breast cancer cell lines 
show a drastic reduction or a complete lack of its expression. The structural 
similarity of NES1 to polypeptides known to regulate growth factor activity and 
a negative correlation of NES1 expression with breast oncogenesis suggest a 
10 direct or indirect role for this protease-like gene product in the suppression of 
tumorigenesis. 

Exemplary fWTSPs 
Each MTSP has a characteristic tissue expression profile; the MTSPs in 
particular, although not exclusively expressed or activated in tumors, exhibit 
1 5 characteristic tumor tissue expression or activation profiles. In some instances, 
MTSPs can have different activity in a tumor cell from a non-tumor cell by virtue 
of a change in a substrate or cof actor therefor or other factor that would alter 
functional activity of the MTSP. Hence each can serve as a diagnostic marker 
for particular tumors, by virtue of a level of activity and/or expression or 
20 function In a subject (i.e. a mammal, particularly a human) with neoplastic 
disease, compared to a subject or subjects that do not have the neoplastic 
disease. In addition, detection of activity (and/or expression) in a particular 
tissue can be indicative of neoplastic disease. Also, by virtue of the activity 
and/or expression profiles of each, they can serve as therapeutic targets, such 
25 as by administration of modulators of the activity thereof, or, as by 

administration of a prodrug specifically activated by one of the MTSPs. Each or 
any of the MTSPs can exhibit activity or expression levels or substrate 
specificities that differ in tumor cells from the levels in normal cells. Such tumor 
cells include, but are not limited to, colon, lung, prostate, breast, esophagous, 
30 pancreas, cervic, uterus, endometrium, and other solid tumors and in blood and 
lymphatic tumors. Hence, conjugates provided herein can be designed by 
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selection of substrate specificity for treatment of any of sucii tumors and 
neoplastic conditions. 

Tissue expression profiles 
The following are exemplary tissue and gene (see also, EXAMPLE 8) 
5 profiles of some exemplary MTSPs. These profiles are not intended to define 
the full scope of expression or activation of these MTPSs, but demonstrate that 
MTSPs are expressed in tumors, and, hence there expression or activation or 
substrate specificity on the surface of tumor cells can be exploited in the 
methods herein and conjugates, designed in accord with the methods herein and 

10 as exemplified herein, that are cleaved by one or more of these MTSPs can be 
prepared and employed for treatment of neoplastic or other diseases or 
conditions or to target to cells that express these proteins on there surfaces. 

MTSPI (matriptase) 
MTSPl (also called matriptase) is a trypsin-lilce serine protease with 

15 broad spectrum cleavage activity and two potential regulatory modules. It was 
named "matriptase" based on its ability to degrade the extra-cellular matrix and 
its trypsin-like activity. When isolated from breast cancer cells {or T-47D cell 
conditioned medium), MTSPI has been reported to be primarily in an 
uncomplexed form. MTSPI has been isolated from human milk; when isolated 

20 from human milk, it was reported to be in one of two complexed forms, 95 kDa 
(the predominant form) and 110 kDa; uncomplexed MTSPI was not detected 
(Liu, etal. (1999) J. Biol. Chem. 274:18237-18242). It has been proposed that 
MTSPI exists as an uncomplexed protease when in its active state. In breast 
milk, it has been reported to exist in complex with a fragment of hepatocyte 

25 growth factor inhibitor-1 (HAI-1), a Kunitz-type serine protease inhibitor having 
activity against trypsin-like serine proteases. 

Nucleic acids encoding the protein designed matriptase were cloned from 
T-47D human breast cancer cell-conditioned medium (Lin et al. (1999) J. Biol. 
Chem. 274:1 823^ -18236). Upon analysis of the cDNA, it was determined that 

30 the full length protease has 683 amino acids and contains three main structural 
regions: a serine protease domain near the carboxyl-terminal region, four 
tandem low-density lipoprotein receptor domains, and two tandem complement 
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subcomponents Clr and CIs (see SEQ ID No. 1). Studies to identify additional 
serine proteases made by cancer cells were done using PC-3 cells. A serine 
protease termed "MT-SP1 " (MTSP1 ) by the authors, reported to be a 
transmembrane protease was cloned (Takeuchi era/. (1999) Proc, Nat!. Acad. 
5 Sci. U.S.A. 96:1 1054-1 1061). It was subsequently found that originally 
identified matriptase sequence is included in the translated sequence of the 
cDNA that encodes MTSP1 . The nucleic acid encoding the protein originally 
designated matriptase is a partial MTSP1 clone that lacks 516 of the coding 
nucleotides (Takeuchi, era/., J. Biol. Chem 275:26333-26342 (2000).) Since 
10 the reported matriptase encoding cDNA sequence encoded a possible initiating 
methionine, it was proposed that alternative splicing could yield a protein lacking 
the N-terminal region of MTSP1 . Hence, matriptase herein is a variant form of 
MTSP1 . 

MTSPI demonstrates trypsin-like protease activity and is a Type II 
1 5 transmembrane protein with an extracellular protease domain. Studies of 

substrate specificity of MTSP1 reveal that protease-activated receptor 2 (PAR2), 
pro-hepatacyte growth factor (pro-HGF) and single-chain urokinase-type 
plasminogen activator (sc-uPA) are macromolecular substrates of MTSP1 , PAR2 
functions in inflammation, cyto protection and/or cell adhesion, while sc-uPa 
20 functions in tumor cell invasion and metastasis. HGF serves a growth and pro- 
angtogenic factor. 

An exemplary nucleotide sequence encoding a human MTSPl is set forth 
in SEQ ID Nos 1 and 2. As previously noted SEQ ID No. 1 sets for an MTSPl - 
encoding nucleic acid sequence. This sequence is the longer version and 
25 includes the protease domain, which is common to both variants. 

MTSPl is expressed in breast, prostate and colorectal tumors. Hence 
conjugates with substrates therefor can be used for treatment of such tumors, 

WiTSPS 

The MTSP3 transcript was detected in lung carcinoma (LX-1 ), colon 
30 adenocarcinoma (CX-1), colon adenocarcinoma (GI-1 12} and ovarian carcinoma 
(Gl-102). No apparent signal was detected in another form of lung carcinoma 
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(GI-117), breast carcinoma (GI-101), pancreatic adenocarcinoma (GI-103) and 
prostatic adenocarcinoma {PC3). 

MTSP4 

The MTSP4 transcript, a DNA fragment encoding part of the LDL receptor 
5 domain and the protease domain was used to probe an RNA blot composed of 
76 different human tissues (catalog number 7775-1; human multiple tissue 
expression (MTE) array; CLONTECH). As in the northern analysis of gel blot, a 
very strong signal was observed in the liver. Signals in other tissues were 
observed in (decreasing signal level): fetal liver > heart = kidney = adrenal 
10 gland - testis = fetal heart and kidney = skeletal muscle = bladder = 

placenta > brain = spinal cord = colon = stomach = spleen = lymph node = 
bone marrow = trachea = uterus = pancreas = salivary gland = mammary 
gland = lung. I\/1TSP4 also is expressed less abundantly in several tumor cell 
lines including HeLa S3 = leukemia K-562 = Burkitt's lymphomas (Raji and 

15 Daudi) = colorectal adenocarcinoma (SW480) > lung carcinoma (A549) = 

leukemia MOLT-4 = leukemia HL-60. PGR of the MTSP4 transcript from cDNA 
libraries made from several human primary tumors xenografted in nude mice 
(human tumor multiple tissue cDNA panel, catalog number K1 522-1, 
CLONTECH) was performed using MTSP4-specific primers. The MTSP4 

20 transcript was detected in breast carcinoma (GI-IOI), lung carcinoma (LX-1), 
colon adenocarcinoma (GI-1 12) and pancreatic adenocarcinoma (GI-103). No 
apparent signal was detected in another form of lung carcinoma (GI-1 17), colon 
adenocarcinoma (CX-1), ovarian carcinoma (GI-1 02). and prostatic 
adenocarcinoma (PCS). The IVITSP4 transcript was also detected in LNCaP and 

25 PC-3 prostate cancer cell lines as well as in HT-1080 human fibrosarcoma cell 
line. 

IVITSP6 

MTSP6 is expressed at high levels In the colon. It also is expressd in the, 
stomach, trachea, mammary gland, thyroid gland, salivary gland, pituitary gland 
30 and pancreas. It is expressed at lower levels in other tissues (see EXAMPLE 6). 
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IVITSP6 also is expressed in several tumor cell lines including HeLa S3 > 
colorectal adenocarcinoma (SW480) > leukemia MOLT-4 > leukemia K-562. In 
mouse xenograft models, the MTSP6 transcript was strongly detected in lung 
carcinoma moderately detected in pancreatic adenocarcinoma (GI-103), 

5 weakly detected in ovarian carcinoma (GI-102); and weakly detected in colon 
adenocarcinoma (GI-1 12 and CX-1), breast carcinoma {Gl-101), lung carcinoma 
(Gl-1 17) and prostatic adenocarcinoma (PCS). The MTSP6 transcript was also 
detected in breast cancer cell line IVIDA-MB-231, prostate cancer cell line PC-3, 
but not in HT-1080 human fibrosarcoma cell line. MTSP6 also is expressed in 
10 mammary gland carcinoma cDNA (Clontech). MTSP6 also is over expressed in 
ovarian tumor cells. 

IWTSP7 

The MTSP7 transcript was detected in lung carcinoma (A549 cell line), 
leukemia (K-562 cell line) and cervical carcinoma (HeLaS3 cell line). MTSP7 is 
15 believed to be expressed in lung, colon, prostate, breast, cervical and other 

* 

tumors. 

IVITSP9 

MTSP9 is, for example, expressed in esophageal tumor tissues, in lung 
carcinoma, in colorectal carcinoma, lymphoma, a cervical carcinoma (HeLaSS) 

20 and leukemia cell lines as well as in certain normal cells and tissues. MTSP9 
also can be a marker for breast, prostate, cervical and colon cancer. 

MTSP9 is highly expressed in the esophagus and expressed at a low level 
in many other tissues. The MTSP9 transcript is found in kidney (adult and 
fetal), spleen (adult and fetal), placenta, liver (adult and fetal), thymus, 

25 peripheral blood leukocyte, lung (adult and fetal), pancreas, lymph node, bone 
marrow, trachea, uterus, prostate, testes, ovary and the gland organs 
(mammary, adrenal, thyroid, pituitary and salivary). MTSP9 also is expressed in 
esophagus tumor tissues, in a lung carcinoma and, at a lower level, in a 
colorectal carcinoma, lymphoma, a cervical carcinoma (HeLaS3) and leukemia 

30 cell lines. 
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MTSP10 

MTSP10, for example, is expressed in esophageal tumor tissues, in lung 
carcinoma, prostate cancers, pancreatic and breast cancers and in cell lines as 
well as in certain normal cells and tissues {see e.g., EXAMPLES for tissue- 
5 specific expression profile). The level of activated MTSP10 can be diagnostic of 
prostate, uterine, lung esophagus, or colon cancer or leukemia or other cancer. 
The expression and/or activation of MTSP10 on or in the vicinity of a cell or in a 
bodily fluid in a subject can be a marker for breast, prostate, lung, colon, 
esophageal and other cancers. 

10 MTSP10 transcript was detected in pancreas, lung and kidney. MTSP10 

transcript was also detected in small intestine Marathon-Ready cDNA (Clontech). 
The MTSP10 transcript was detected in breast carcinoma (GI-101 ), lung 
carcinoma (LX-1 and GI-117), ovarian carcinoma (Gi-102), and pancreatic 
adenocarcinoma (GI-103). The MTSPIO transcript was weakly detected in 

15 prostatic adenocarcinoma (PC3). The MTSP10 transcript was also detected in 
CWR22R prostate tumor grown in nude mice. No apparent signal was detected 
in two forms of colon adenocarcinomas (GI-1 1 2 and CX-1 ). 

MTSP12 

MTSP1 2 transcript was detected in pancreas, lung and kidney. MTSP1 2 
20 transcript was also detected in small intestine Marathon-Ready cDNA (Clontech). 
The MTSP12 transcript was detected in breast carcinoma (GI-101), lung 
carcinoma (LX-1 and GI-117), ovarian carcinoma (GI-102), and pancreatic 
adenocarcinoma (GI-103). The MTSP12 transcript was weakly detected in 
prostatic adenocarcinoma (PCS). The MTSP12 transcript was also detected in 
25 CWR22R prostate tumor grown on nude mice. No apparent signal was detected 
in two forms of colon adenocarcinomas (GI-1 1 2 and CX-1 ). 

WITSP20 

MTSP20 Is expressed in the lung, colon, cervical tumors and in leukemic 
cells. It may also be expressed In breast, ovarian, pancreatic, prostate and in 
30 other tumors. MTSP20 transcript was detected in liver, lymph node, 

cerebellum, pancreas, prostate, uterus, testis, glands (adrenal, thyroid and 
salivary), thymus, kidney and spleen. Lower transcript level was found in lung. 
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placenta, bladder, ovary, digestive system, circulatory system and other parts of 
the the brain. MTSP20 is also expressed in certain tumor cell lines including 
lung carcinoma (A519), colorectal carcinoma (SW480), lymphoma (Raji and 
Daudi). cervical carcinoma (HeLaSS) and leukemia (HL-60, K-562 and MOLT-4) 
5 cell lines, 

WITSP22 

MTSP22 is expressed in the uterine tissue, thymus, adipose tissue, and 
lymph node. It may also be expressed in lung, stomach, uterine, breast, 
ovarian, prostate and in other tumors. MTSP22 transcript was detected in some 

10 uterus tissue samples, but not in their matched tumor samples. In one of 42 
uterus samples, MTSP22 is expressed in tumor and its metastatic tissues, but 
not in the normal tissue counterpart. MTSP22 is also expressed in some 
stomach tumors and lung tumors, but not in their normal tissue counterparts. 
MTSP22 is also expressed in the normal tissue of a pancreas matched cDNA 

15 pair. MTSP22-encoding cDNA was detected in thymus, adipose tissue, and 
lymph node 

WITSP25 

MTSP25 is expressed m breast, colon, uterine, ovarian, kidney, prostate, 
testicular cancer tissue. It may also be expressed in lung, stomach, prostate 

20 and in other tumors. MTSP25 transcript was expressed weakly in the lymph 
node. In the cancer profiling array analysis, MTSP25 is highly expressed in 
prostate samples {in normal and cancer samples). IVITSP25 was highly 
expressed in a kidney tumor sample, but not in its normal tissue counterpart. 
MTSP25 was also expressed a breast cancer samples, but not in its normal 

25 tissue counterpart. I\/ITSP25 was expressed in normal uterus samples, but not 
in their tumor counterparts. MTSP25 expression was also ovarian cancer 
samples. Among these three samples, the expression of MTSP25 was also 
detected in one of the matched normal tissue counterparts. MTSP25 expression 
was also detected in tumor samples in colon cDNA pairs. 

30 PCR analysis revealed that MTSP25 cDNA was strongly detected in testis 

and mammary gland adenocarcinoma, weakly detected in brain, placenta, lung. 
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Spleen, prostate, small Intestine, colon, and leukocyte, and very weakly detected 
in heart, liver and pancreas. 
2. Endotheiiases 

Endotheliases are a class of cell surface proteases that are expressed on 
5 cells, particularly endothelial cells, particularly those proliferating endothelial 
cells, which are involved in a variety of proliferative processes, including 
undesirable angiogenests associated with tumor growth and metastasis, and 
with other hyperproliferative disorders, such as restenosis, scarring, diabetic 
retinopathies, diseases and disorders of the anterior eye (see, U.S. application 

10 Serial No. 09/717,473, filed November 20, 2000, and International PCT 
application No. PCT/USOO/31 803). 

Proliferative diseases 
Endotheliases are particularly useful targets for delivery of therapeutic 
agents for treatment of any disorder involving aberrant angiogenesis. 

15 Endothelial cells play a key role in angiogenesis, which Is 

is the generation of new blood vessels from parent microvessels. Angiogenesis 
plays a major role in the metastasis of cancer and in the pathology of a variety 
of other disorders. 

Controlled and uncontrolled angiogenesis proceed in a similar manner. 

20 Endothelial cells and pericytes, surrounded by a basement membrane, form 

capillary blood vessels. Angiogenesis begins with the erosion of the basement 
membrane by enzymes released by endothelial cells and leukocytes. The 
endothelial cells, which line the lumen of blood vessels, then protrude through 
the basement membrane. Angiogenic stimulants induce the endothelial cells to 

25 migrate through the eroded basement membrane. The migrating cells form a 

"sprout" off the parent blood vessel, where the endothelial cells undergo mitosis 
and proliferate. The endothelial sprouts merge with each other to form capillary 
loops, creating the new blood vessel. 
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Angiogenesis, modulators and associated diseases 
Angiogenesis is highly regulated by a system of angiogenic stimulators 
and inhibitors. Known examples of angiogenesis stimulators include certain 
growth factors, cytokines, proteins, peptides, carbohydrates and lipids (Norrby 
5 (1997) APM/S 705:417-437); Polverini (1995) C/it. Rev. Oral. Biol. Med. 6:230- 
247). A variety of endogenous and exogenous angiogenesis inhibitors are 
known in the art (Jackson eta/. (1997) FASEB 7 7:457-465; Norrby (1997) 
APMIS 705:417-437); and O'Reilly (1997) Investigational New Drugs, 15:5-13). 
Angiogenesis is essential for norma! placental, embryonic, fetal and post- 
10 natal development and growth, but almost never occurs physiologically in 
adulthood except in very specific restricted situations. For example, 
angiogenesis is normally observed in wound healinig, fetal and embryonal 
development and formation of the corpus luteum, endometrium and placenta. 
Angiogenesis in the adult is often associated with disease states. 

1 B Persistent, unregulated angiogenesis occurs in a multiplicity of disease 

states, tumor metastasis and abnormal growth by endothelial cells and supports 
the pathological damage seen in these conditions. The diverse pathological 
disease states in which unregulated angiogenesis is present have been grouped 
together as angiogenic dependent or angiogenic associated diseases. 

20 The control of angiogenesis is altered In certain disease states and, in 

many cases, the pathological damage associated with the disease is related to 
uncontrolled angiogenesis {see generally, Norrby (1997) APMIS 705:417-437); 
and O'Reilly (1997) Investigational New Drugs 75:5-13). Thus, angiogenesis is 
involved in the manifestation or progress of various diseases, for example, 

25 various inflammatory diseases, such as rheumatoid arthritis, psoriasis, diabetic 
retinopathies, certain ocular disorders, including recurrence of pterygii, scarring 
excimer laser surgery and glaucoma filtering surgery, various disorders of the 
anterior eye, cardiovascular disorders, chronic inflammatory diseases, wound 
repair, circulatory disorders, crest syndromes, dermatological disorders (see, 

30 e.g., U.S. Patent Nos. 5,593,990, 5,629,327 and 5,712,291) and notably 
cancer, including solid neoplasms and vascular tumors. Angiogenesis is 
essential for the growth and persistence of solid tumors and their metastases. 



wo 02/095007 PCT/US02/16819 



-73- 

Repressing, eliminating or modulating this activity, should Impact the etiology of 
these diseases and serve as a point of therapeutic intervention. In the disease 
state, prevention of angiogenesis could avert the damage caused by the invasion 
of the new microvascular system. Therapies directed at control of the 
5 angiogenic processes could lead to the abrogation or mitigation of these 
diseases. Hence there is a need to develop therapeutics that target 
angiogenesis and modulate, particularly, inhibit aberrant or uncontrolled 
angiogenesis. 

Hence conjugates that contain endotheliase substrates can be used to 
10 deliver therapeutic agents for the treatment of diseases including, but are not 
limited to, rheumatoid arthritis, psoriasis, diabetic retinopathies, other ocular 
disorders. Including recurrence of pterygil, scarring from excimer laser surgery 
and glaucoma filtering surgery, various disorders of the anterior eye, 
cardiovascular disorders, autoimmune diseases, chronic inflammatory diseases, 
15 wounds, circulatory disorders, crest syndromes, restenosis, psoriasis and other 
dermatological disorders (see, e.g., U.S. Patent Nos. 5,593,990, 5,629,327 
and 5,712,291) and notably cancer. Including solid neoplasms and vascular 
tumors. 

Endotheliases 1 and 2 

20 Exemplary of endotheliases are two different endotheliases and variant 

forms thereof designated endotheliase 1 and endotheliase 2 (see SEQ ID Nos. 
21-27. Other members of the family can be identified by probing for genes or 
searching libraries for genes that have sequence identity, particularly at least 
40%, 60%, 80%, 90%, 95%, 98% or greater sequence identity to the protease 

25 domain of an endotheliase Identified herein, or that hybridize under conditions of 
high stringency to the full-length of the nucleic acid encoding a protease domain 
of an endotheliase provided herein, and that are expressed on endothelial cells. 

Alternatively, and as a way of identifying endotheliases that can have 
lower sequence identity, an endotheliase can be identified by the methods, such 

30 by identifying ESTs or other nucleic acid fragments that have sequences similar 
to a protease and then using such fragments as probes to identify and select 
cDNA clones encoding full-length proteases or protease domains thereof, 
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identifying those that have the characteristics of transmembrane proteins, and 
then determining the gene expression profile to identify those that are expressed 
on the surface of endothelial cells. Encoded proteins that have protease activity, 
that include a transmembrane domain and an extracellular domain, and that are 
5 expressed in endothelial cells are endotheliases. Any method for identification 
of genes encoding proteins (or proteins) that encode a transmembrane protease 
expressed on an endothelial cell is contemplated herein. 
Endotheliase 1 

Exemplary of the endotheliase are endotheliase 1 and endotheliase 2. 
10 These are expressed on endothelial cells. Exemplary of a full-length endotheliase 
1 is one that includes the sequence of amino acids set forth in SEQ ID No. 42 
(see, International PCT application No. WO 00/5006, which describes a gene it 
designates DESC1 that is expressed in squamous cell carcinomas and prostate 
tumors). As noted endotheliases are expressed on endothelial cells. A protease 
15 domain thereof is set forth in SEQ ID NO: 22. 

Expression profile of endotheliase 1 
To obtain information regarding the tissue distribution of endotheliase 1 , 
the DNA Insert of clone H1 17 was used to probe an RNA blot composed of 76 
different human tissues (catalog number 7775-1; human multiple tissue 
20 expression (MTE) array; CLONTECH, Palo Alto, CA). Significant expression was 
observed in the esophagus, with minor expression levels in the stomach, salivary 
gland, pancreas, prostate, bladder, trachea and uterus. Northern analysis using 
RNA blots (catalog numbers 7765-1 & 7782-1 ; human muscle and digestive 
system multiple tissue northern (MTN) blots; CLONTECH) confirmed that the 
25 expression was restricted to the esophagus. Two transcripts (approximately 1 .7 
and 2 kb) were detected in the esophagus. Endotheliase 1 also is expressed in 
umbilical vein endothelial cells, PC3 and LnCAP cells. 

Endotheliase 2 and nucleic acids encoding endotheliase 2 
Two splice variant forms of endotheliase 2 designated endotheliase 2-S 
30 and endotheliase 2-L are exemplified herein (see SEQ ID Nos. 23-26). The open 
reading frame of the nucleic acid encoding endotheliase 2-S (SEQ ID No. 23} is 
composed of 1 ,689 bp, which translates to a 562-amino acid protein (SEQ ID 
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No. 24), while the ORF of endotheliase 2-L is composed of 2,067 bp (SEQ ID 

No. 25}, which translates to a 688-amino acid protein (SEQ ID No. 26). 

The nucleic acid encoding the protease domain of endotheliase 2-S is 

composed of 729 bp which translates to a 242-amino acid protein {amino acids 

5 321-562 of SEQ ID Nos. 23 and 24), while that of endotheliase 2-L is composed 

of 1,107 bp, which translates to a 368-amino acid protein (amino acids 321-688 

of SEQ ID Nos. 25 and 26). 

Endotheliase-2 proteins 

Any and all of the above-noted endotheiiases and/or protease domains 

10 thereof, such as those that include the sequences of amino acids in SEQ ID Nos. 

22, 24, 26 and 27 or are encoded by nucleic acid that hybridize thereto under 

the conditions as described above are contemplated for use in the methods 

herein. Also contemplated herein are proteins that include amino acid sequence 

changes, such as those set forth in Table 1 above, and retain protease activity. 

15 Gene expression profile and transcript size of endotheliase 

2 in normal and tumor tissues 

In addition to expression in endothelial cells, endotheliase 2 is expressed 

in placenta, pancreas, thyroid gland, liver and lung tissues. It also is expressed 

at lower levels in mammary gland, salivary gland, kidney, trachea, esophagus, 

20 appendix, heart and fetal tung. Endotheliase 2 also is expressed in several tumor 
cell linos and, hence, in certain tumors, including lung and colon. Including 
breast carcinoma, lung carcinomas, colon adenocarcinomas, pancreatic 
adenocarcinoma (Gl-103), and ovarian carcinoma. It has also been detected in 
prostate and fibrosarcoma cell lines. 

25 C. Conjugates 

Conjugates that are substrates for proteases on the surfaces of cells, 
particularly serine proteases, including type II membrane-bound serine proteases, 
and endotheiiases are provided. Any cell surface protease, including cell- 
associated or localized proteases, is contemplated herein. Generally proteases 

30 expressed at high levels in active forms in essential tissues are not ideal target 
candidates. The proteases include those that are expressed on relatively lim'rted 
numbers of cells or that are expressed at high levels in cells, such as tumor cells 
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and endothelial celts and immune cells, that are involved in disease states or are 
present in diseases states in the locale of cells involved in the disease states. 
For example, endothelial cells by virtue of their role in angiogenesis are involved 
in numerous proliferative disorders; immune cells are involved in many disease 
5 processes including cancers and diseases and inflammatory disorders. Other cell 
surface proteases are expressed at higher levels in certain tumors than in normal 
cells. Whether or not such proteases have a role in the disorder their higher 
expression in cells involved in a disease state is sufficient for use for targeting 
therapeutic agents in the conjugates provided herein. 

10 The conjugates, which contain a therapeutic agent, such as a cytotoxic 

agent, is activated upon cleavage by a cell surface protease, including cell- 
associated and cell-localized proteses. Exemplary of such proteases are the 
MTSPs, such as, but not limited to, MTSP1, MTSP3, MTsP4, MTSP6, MTSP7, 
MTSP9, MTSP10, MTSP12, IVITSP20, MTSP22, MTSP25, urokinases and 

15 endotheliases. Hence, the conjugates targeted to such proteases are prodrugs in 
that the therapeutic agent is inactive as administered and is ultimately activated 
in the vicinity of the targeted cell or tissue. Although cell surface proteases, 
such as transmembrane proteases, are the intended targets, any released, shed 
or soluble forms of the proteases and others also can be targeted. 

20 Thus, the conjugates, which contain a therapeutic agent, such as a 

cytotoxic agent, are substantially inactive prior to action by a cell surface 
protease, a peptidic moiety that is a substrate for a targeted cell surface 
protease (/.e., a peptidic substrate), and, optionally, a linker. The therapeutic 
agents in the conjugates are activated upon cleavage of the peptidic substrate of 

25 the conjugate by a cell surface protease. The therapeutic agents, such as 

cytotoxic agents, are released as the free yagent, or, alternatively, are released 
coupled to the portion of the peptidic substrate (PI -P2-P3-etc. {i.e., the N- 
terminus) or P1'-P2'-etc- (/.e., the C-terminus) that the agents were linked to in 
the conjugate, optionally via a linker. The cytotoxic agents, in these forms, are 

30 released in the vicinity of cells that express the proteases. Activation is 
effected, in certain embodiments, because the therapeutic agent, such as 
cytotoxic agent, following action of the cell surface protease, can cross the cell 
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membrane or otherwise interact with the cell or tissue and exhibit therapeutic 
activity. In other embodiments, any remaining peptidic moieties or amino acids 
can be cleaved from therapeutic agent to render It active. The conjugates act as 
prodrugs because the therapeutic agents when conjugated are substantially 
5 inactive. Upon cleavage by the targeted protease, the therapeutic agent is 
released either in active form or in a form that is activated by the targeted cell, 
tissue or surrounding environment. 

In one exemplary embodiment, the targeted agent is a cytotoxic agent 
and the conjugates for use in the methods and compositions provided herein 

10 have the formula: 

(peptide'),-f 'n'<^6'')q-{cytotoxic agent)^ 
or a derivative thereof, where peptide' is a peptidic substrate for a cell surface 
protease or a released, shed or otherwise unbound membrane protease, such as 
an MTSP; s is greater than or equal to 1 , or is 1 to 6, or is 1 or 2, or is 1 ; linker 

15 is any linker; q is greater than or equal to 0, or is 0 to 4, or is 0 or 1; the 

cytotoxic agent is an anti-tumor, anti-cancer or anti mitotic agent, including anti- 
antiangiogenic agents; and t is 1 or more, or is 1 or 2. In these conjugates, the 
cytotoxic agent is covalently attached, optionally via a linker, to either the C- 
terminus or the N-terminus of the peptidic substrate. In embodiments where the 

20 therapeutic agent, such as a cytotoxic agent, is attached to the C-terminus of 
the peptidic substrate, the N-terminus optionally is capped. N-Terminal caps for 
use herein include, but are not limited to, acy), sulfonyl and carbamoyl groups. 
In embodiments where the therapeutic agent is attached to the N-terminus of 
the peptidic substrate, the C-terminus is a carboxamide derivative. 

25 In certain embodiments, peptide' is a peptidic substrate for a cell surface 

protease or a soluble MTSP whereby, upon action of the protease, the 
conjugate, which is substantially inactive, is cleaved at the PI -PI ' bond to 
release a compound of the formula: 

(peptide')3-{linker)qHtherapeutic agent)^ 

30 or a derivative thereof, that exhibits therapeutic activity, such as cytotoxic 

activity in vitro and in vivo. In these compounds, peptide" is a truncated version 
of peptide' resulting from cleavage at the Pl-PT bond. 
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In another embodiment, the conjugates for use in the methods and 
compositions provided herein possess two therapeutic agents, such as cytotoxic 
agents, which are the same or different, linked to the C-terminus and the N- 
terminus, respectively, optionally via linkers linker^ and linker^, of a peptidic 
5 substrate for cell surface protease or a soluble MTSP. In this embodiment, the 
conjugates have the formula: 

(therapeutic agent^),-(linker^)^-(peptlde'),-(linker^)q-(therapeutlc agent^), 

or a derivative thereof, where peptide' is a peptidic substrate for a cell surface 

protease, or a soluble MTSP; s is greater than or equal to 1 , or is 1 to 6, or is 1 

10 or 2, or is 1 ; linker^ and linker^ are each independently any linker and are the 

same or different; q and w are each independently greater than or equal to 0, or 
are 0 to 4, or are 0 or 1; the therapeutic agents, which are the same or 
different, are anti-tumor, anti-cancer or anti mitotic agents; and t and x are each 
independently 1 or more, or are 1 or 2. 

15 In these embodiments, peptide' is a peptidic substrate for a cell surface 

protease or a soluble MTSP whereby, upon action of the protease, the 
conjugate, which Is substantially inactive, is cleaved at a point on the peptidic 
chain to release two compounds of the formulae: 
(therapeutic agent^)^-(linkerMw-{peptide'"),; and 

20 {peptide*^)s-((inker^)q- (therapeutic agent^)^ 

or derivatives thereof. The released therapeutic agents are active or are further 
activated by the cell, tissue or surrounding environment. In these compounds, 
peptide"^ and peptide"^ are N-terminal and C-terminal truncated portions, 
respectively, of peptide* resulting from cleavage at the PI-PT bond. 

25 In one embodiment, the conjugates for use in the compositions and 

methods provided herein have formula I: 
X"-{P6)^-(P5)p-(P4),-(P3)j-{P2),-P1-(Pr),-(P2')k-(P3'),-iL)„-Z 

or a derivative thereof, where Z is a therapeutic agent; L is a linker; I, j, i, p and 
m are selected as follows: 
30 I is 0 or 1 ; when I is 0, j, i, p and m are 0; when I is 1 , j is 0 or 1 ; when j 

is 0, i, p and m are 0; when j is 1 , i is 0 or 1 ; when i is 0, p and m are 0; when i 
is 1 , p is O or 1 ; when p is 0, m is 0; when p is 1 , m is 0 or 1 ; 
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u, k and r are selected as follows: 

u is 0 or 1 ; when u is 0, k and r are 0; when u is 1 , k is 0 or 1 ; when k is 
O, r is 0; when k is 1 , r Is 0 or 1 ; 

n is 0 or 1 ; X" is hydrogen, or an acyl, sulfonyl or carbamoyl cap; and P6 
5 to P3' are amino acid residues, as defined below. In this embodiment, the P6 to 
P3' residues are linked by peptide bonds or peptide bond surrogates. Thus, the 
P6 to P3' portion of the conjugate is a peptidic substrate, as defined herein. 

In another embodiment, the conjugates for use in the compositions and 
methods provided herein have formula II: 
10 Z-(L)„-(P6)^-(P5)p-(P4);-(P3)j-(P2),-P1-(PV)^-(P2')k-(P3')rX'= 

or a derivative thereof, where Z is a therapeutic agent; L is a linker; t, j, i, p and 
m are selected as follows: 

I is 0 or 1 ; when 1 is 0, j, i, p and m are 0; when I is 1 , j is 0 or 1 ; when j 
Is 0, i, p and m are 0; when j is 1 , i is 0 or 1 ; when i is 0, p and m are 0; when i 
15 is 1 , p is 0 or 1 ; when p is 0, m is 0; when p is 1 , m is 0 or 1 ; 

u, k and r are selected as follows: 

u is 0 or 1 ; when u is O, k and r are 0; when u is 1 , k Is O or 1; when k is 
0, r is 0; when k is 1 , r is 0 or 1 ; 

n is 0 or 1 ; X'=, together with the carbonyl group of the amino acid 
20 residue to which it is attached, forms a carboxylic acid or a carboxamide group; 
and P6 to P3' are amino acid residues, as defined below. Jn this embodiment, 
the P6 to P3' residues are linked by peptide bonds or peptide bond surrogates. 
Thus, the P6 to P3' portion of the conjugate is a peptidic substrate, as defined 
herein. 

25 In a further embodiment, the conjugates for use in the compositions and 

methods provided herein have formula III: 
Z^-(L^)„-(P6)„,-(P5)p-(P4)r(P3)j-(P2),-P1-{Pr)„-(P2')k-{P3'),-(L2)„-Z2 
or a derivative thereof, where Z^ and 7} are each therapeutic agents and are the 
same or different; and are each linkers and are the same or different; I, j, i, 

30 p and m are selected as follows: 
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I is 0 or 1 ; when I is 0, j, i, p and m are 0; when I is 1 , j is 0 or 1 ; when j 
is 0, i, p and m.are 0; when j is 1 , t is 0 or 1 ; when i is 0, p and m are 0; when i 
is 1 , p is 0 or 1 ; when p is 0, m is 0; when p is 1 , m is 0 or 1 ; 

u, k and r are selected as follows: 
5 u is 0 or 1 ; when u is 0, k and r are 0; when u is 1 , k is 0 or 1 ; when k is 

0, r is 0; when k is 1 , r is 0 or 1 ; 

n and v are each independently 0 or 1 ; and P6 to P3' are amino acid 
residues, as defined below. In this embodiment, the P6 to P3' residues are 
linked by peptide bonds or peptide bond surrogates. Thus, the P6 to P3' portion 
10 of the conjugate is a peptidic substrate, as defined herein. 

In another embodiment, the conjugates for use in the compositions and 
methods provided herein have formula IV: 
X'*-(P6)^-(P5)p-{P4)r(P3)j-(P2),-P1 -(PI ')„-(P2'),-(P3'),-(P4')r(L)„-Z 
or a derivative thereof, where Z is a therapeutic agent; L Is a linker; I, j, i, p and 
15 m are selected as follows: 

I is 0 or 1 ; when I is 0, j, i, p and m are 0; when I is 1 , j is 0 or 1 ; when j 
is 0, i, p and m are 0; when j is 1 , i is 0 or 1 ; when i is 0, p and m are 0; when i 
is 1 , p is 0 or 1 ; when p is 0, m is 0; when p is 1 , m Is 0 or 1 ; 

u, k, r and s are selected as follows: 
20 u is 0 or 1 ; when u Is 0, k, r and s are 0; when u Is 1 , k is O or 1 ; when k 

is 0, r and s are 0; when k is 1 , r is 0 or 1 ; when r is 0, s is 0; when r is 1 , s is 
O or 1 ; 

n is 0 or 1 ; X" is hydrogen, or an acyl, sulfonyl or carbamoyl cap; and P6 
to P4' are amino acid residues, as defined below. In this embodiment, the P6 to 
25 P4' residues are linked by peptide bonds or peptide bond surrogates. Thus, the 
P6 to P4' portion of the conjugate is a peptidic substrate, as defined herein. In 
another embodiment, the conjugates for use in the compositions and methods 
provided herein have formula V: 

Z-(L)„-(P6)^-(P5)p-{P4V(P3),-<P2)rP1-{P1')„-(P2'),-(P3')-(P4'),-X'= 
30 or a derivative thereof, where Z is a therapeutic agent; L Is a linker; I, j, i, p and 
m are selected as follows: 
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I is 0 or 1 ; when I is 0, ], i, p and m are O; when I is 1 , j is 0 or 1 ; when j 
is 0, i, p and m are 0; when j is 1 , i is 0 or 1 ; when i is 0, p and m are 0; when i 
is 1 , p is 0 or 1 ; when p is 0, m is 0; when p is 1 , m is 0 or 1 ; 

u, k, r and s are selected as follows: 
5 u is 0 or 1 ; when u is 0, k, r and s are 0; when u is 1 , k is 0 or 1 ; when k 

is 0, r and s are 0; when k is 1 , r is 0 or 1; when r is 0, s is 0; when r is 1 , s is 
0 or 1; 

n is O or 1 ; X'^, together with the carbonyl group of the amino acid 
residue to which it is attached, forms a carboxylic acid or a carboxamide group; 
10 and P6 to P4' are amino acid residues, as defined below. In this embodiment, 
the P6 to P4' residues are linked by peptide bonds or peptide bond surrogates. 
Thus, the P6 to P4' portion of the conjugate is a peptidic substrate, as defined 
herein. 

tn a further embodiment, the conjugates for use in the compositions and 

15 methods provided herein have formula Vi: 

2^-(U)n-(P6)^-(P5)p-(P4)HP3)j-(P2},-P1-(P1')„-(P2')k-(P3')-(P4'),-(L2),-22 
or a derivative thereof, where and 2^ are each therapeutic agents and are the 
same or different; and are each linkers and are the same or different; i, j, i, 
p and m are selected as follows: 

20 I is 0 or 1 ; when I is 0, j, i, p and m are 0; when I is 1 , j is 0 or 1 ; when j 

is 0, i, p and m are 0; when j is 1 , i is 0 or 1 ; when i is 0, p and m are 0; when i 
is 1 , p is 0 or 1 ; when p is 0, m is 0; when p is 1 , m is O or 1 ; 
u, k, r and s are selected as follows: 

u is 0 or 1 ; when u is 0, k, r and s are 0; when u is 1 , k is 0 or 1 ; when k 
25 is 0, r and s are 0; when k is 1 , r is 0 or 1 ; when r is 0, s is 0; when r is 1 , s is 
0 or 1; 

n and v are each independently 0 or 1 ; and P6 to P4' are amino acid 
residues, as defined below. In this embodiment, the P6 to P4' residues are 
linked by peptide bonds or peptide bond surrogates. Thus, the P6 to P4' portion 
30 of the conjugate is a peptidic substrate, as defined herein. 

Exemplary peptidic substrates, therapeutic agents, linkers and exemplary 
conjugates of formulae l-VI are described in further detail below. It is intended 
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herein that conjugates resulting from all combinations and/or permutations of the 
groups recited below for the variables of formulae i-V( are encompassed within 
the instant disclosure. 

1 . Peptidic Substrates 
5 The peptidic substrates contemplated for use in the conjugates are 

substrates for the targeted cell surface protease or a soluble, shed or released 
form thereof, and contain a sufficient number of amino acid residues to render 
any therapeutic agent in the conjugate substantially inactive. In the exemplary 
embodiment where the therapeutic agent is, for example, doxorubicin, the 
10 conjugate is substantially inactive by virtue of the inability of the conjugated 
therapeutic agent to cross the cell membrane. In certain embodiments, the 
peptidic substrate contains at least 1 , 2, 3, 4 or 5 amino acid residues, and can 
contain up to nine or ten residues. Longer peptidic substrates can be used in 
the conjugates as long as upon cleavage, the resulting therapeutic agent or 
15 therapeutic agent-amino acid or -peptidic moiety conjugate exhibits the desired 
therapeutic effect in vivo and in vitro. 

Hence, exemplary peptidic substrates for use in the conjugates provided 
herein possess at least one amino acid (PI), two amino acids (PT-PT), three 
amino acids (P2-P1-PV) and typically contain four, five or six amino acid 
20 residues (P3-P2-P1-PV, P4-P3-P2-P1 -PI' or P4-P3-P2-P1-P1 '-P2'), where the 

P1-P1' bond is the site of cleavage of cell surface protease, or a soluble, shed or 
released form thereof, including, but not limited to, a cell surface protease, such 
as a serine protease, including, for example, but not limited to, uPA bound to its 
receptor, MTSPs and endotheliases. The peptidic substrates optionally further 
25 possess a P5, P6 or P3' amino acid residue, and, in certain embodiments, 

possess P7, P8, P9, P10, P4', P5', P6' residues. Thus, the peptidic substrates 
for use in the conjugates provided herein are penta-, hexa-, hepta-, octa- and 
nona-peptidic substrates, and can contain 10, 11, 12, 13, 14, 15 or more 
residues as long as, upon cleavage of the conjugate by the protease, the 
30 resulting therapeutic agent or therapeutic agent-amino acid or -peptidic moiety 
conjugate exhibits the desired therapeutic effect in vivo and in vitro. 
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The peptidic substrates are conjugated to the therapeutic agent (or to a 
linker to which the therapeutic agent Is linked) via the C-terminal residue {i.e., 
PV, P2' or P3'), or the N-terminat residue {i.e., P6, P5 or P4), or optionally an 
internal residue. The peptidic substrates, for example, can be straight chains, 
5 but can be cyctized or include cyclized portions. 

In embodiments where the conjugation is via the C-termlnus of the 
peptidic substrate, the peptidic substrate optionally possesses a cap, such as an 
acyl or carbamoyl cap at the N-terminus. In embodiments where conjugation is 
via the N-terminus of the peptidic substrate, the peptidic substrate further 
10 possess a terminal group, such as a carboxamide group, at the C-terminus. 

The conjugates can contain a plurality of peptidic substrates and a 
plurality of therapeutic agents. For example, in conjugates that contain two 
therapeutic agents, which are the same or different, conjugation to the 
therapeutic agent(s) or linker linked thereto can be via the C-termlnal and N- 
15 terminal residues of the peptidic substrate. 

The methods described for selection of substrates above can be used to 
design suitable substrates. In addition, substrates can be designed based upon 
known specificities of other proteases. For example, the specificities of trypsin- 
llke and trypsin family members can aid in design of possible substrates. The 
20 following summarizes substrate preferences for particular serine proteases (see, 
e.g., Han-is eta/. (2000) PNAS 97(74):77B4'77bB). 



PROTEASE 


EXEMPLARY PI 
RESIDUE(S) 


EXEMPLARY P2 
RESIDUE(S) 


EXEMPLARY P3 
RESIDUE(S) 


Chymotrypsin 


Tyr, Phe, Trp 






Trypsin 


Arg, Lys 






Thrombin 


Arg, Lys 


Phe 


Thr, Trp 


Plasmin 


Lys, Arg 


Trp, Tyr, Met 


Gin 


Granzyme B 


Asp 






Human Neutrophil 
Elastase 


Ala, Val, lie 






Tissue Plasminogen 
Factor 


Arg 


Ser, Gly, Ala 


Met, Tyr 


Urokinase 


Arg 


Ser, Ala 


Thr, Ser 


Factor Xa 


Arg 


Gly 
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Typical protocols for preparation of the conjugates can include the steps 
of: 1) identification of a targeted protease; 2) expression and assay 
development; 3) substrate selection, such as, for example, by testing 
chromogenic or fluorogenic substrates to identify those cleaved by a selected 
5 target protease, by use of substrate phage display to identify peptidic substrates 
cleaved by a targeted protease, by use of a natural protein or peptide substrate 
or a natural inhibitor of the protease, and by use of combinatorial libraries to 
identify substrates cleaved by a targeted protease; 4) synthesis of conjugates 
containing the identified substrate; and 5) biological evaluation thereof, 

10 including, but not limited to, in vitro assays, cell culture assays, biological 
assays, and in vivo animal models (see, e.g., EXAMPLE 10). 

A conjugate can be designed by any methods known to those of skill in 
the art. The following provides an exemplary protocol. First, a series of 
commercially available chromogenic and fluorogenic peptidic substrates can be 

1 5 tested for cleavage by the protease of interest (see Examples for lists of 

exemplary chromogenic and fluorogenic substrates and the table below). The 
peptidic portion of these substrates occupies the unprimed binding sites of the 
protease while the reporter group is located on the primed side of the scissle 
bond. Effective conjugates can then be designed based on the structure of the 

20 substrates that are efficiently cleaved by the protease. 

The peptidic portion of these efficiently cleaved substrates can be used 
as the unprimed region of the conjugate, and Ser-therapeutic agent, such as a 
cytotoxic agent (e.g., doxorubicin), Ser- Leu-therapeutic agent or Ser-Ser-Leu- 
therapeutic agent can be used as the primed region of the conjugate. Cleavage 

25 of these conjugate prodrugs releases either Ser-therapeutic agent, Ser-Leu- 
therapeutic agent or Ser-Ser-Leu-therapeutic agent compounds. In another 
embodiment, the Ser in the released Ser-therapeutic agent may be replaced by 
other amino acid residues including, but not limited to, Ala, hSer, Abu, Thr, Met, 
nLeu and Val. In another embodiment, such as when the therapeutic agent is 

30 doxorubicin, the amino acid residue conjugated to the therapeutic agent 

possesses a hydrophobic side chain. Such amino acid residues include, but are 
not limited to. Leu, Abu, nLeu, nVal, CHA, hCHA, (hex)Gly, (allyOGly, 
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{propargyDGly and (cyclopropyl)Ala. In another embodiment, such as when the 
therapeutic agent is taxol, the amino acid residue conjugated to the therapeutic 
agent possesses a side chain that is not sterically bulky. Such amino acid 
residues include, but are not limited to, Gly and Ala. The resulting PV- 
5 therapeutic agent, PI '-P2 '-therapeutic agent, or PI '-P2'-P3'-therapeutic agent 
compound can be further processed in vivo into active therapeutic agents. 

Another approach to designing a conjugate prodrug for a protease 
substrate is to use substrate phage display to elucidate optimal subsite 
occupancy for the protease. The resulting information can then be used to 
10 design the pepttdic, unprimed portion of the conjugate. As described above, the 
primed region of the conjugate can be fixed as Ser-therapeutic agent, Ser-Leu- 
therapeutic agent or Ser-Ser-Leu-therapeutic agent. 

A third approach to design an effective prodrug conjugate Involves the 
use of combinatorial fluorogenic substrate libraries to determine optimal residues 
1 5 for the unprimed region of a protease substrate. These selected sequences can 
then be used as the unprimed portion of the conjugate prodrug and, and Ser- 
therapeutic agent, {e.g., doxorubicin), Ser-Leu-therapeutic agent or Ser-Ser-Leu- 
therapeutic agent can be used as the primed region of the conjugate. 
These methods have been used in the design of the peptidic substrate portion of 
20 the conjugates provided herein. For example, sequences including GSGR (and 
related sequences such as TGR, SGR, extended variants and others herein) were 
based on or dervied from substrate phage display experiments using u-PA as the 
taret protease. Many matriptase conjugates, such as (R/K)-X-S-R and X-(R/K)-S- 
R, and related sequences as provided herein, were based on data from 
25 combinatorial libraries. In other embodiemnts, seqeuence sequences in natural 
substrates or natural inhibitors of a protease target, such as uPA, including 
VSAR, PGR (from P3-P1 of plasminogen) and related sequences, were used in 
design of u-PA-targetd conjugates. In other embodiments, sequences from 
chromgenic substrates, such as D-HHT-Gly-Arg, and related sequences, were 
30 used for design of ET-1 -targeted conjugates. 

Chromogenic/fluofogenic substrates 



Enzyme Substrate Structure 
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MTSP1 


spectrozyme t-PA 


CHsSOz-D-HHT-Gly-Arg-pNA.AcOH 


MTSPl 


S 2765 


N-or-Z-D-Arg-Gly-Arg-pNA.2HCI 


MTSP3 


Spectrozyme fXWa 


H-D-CHT-Gly-Arg-pNA.2AcOH 


MTSP4' 


Spec PL 


H-D-Nle-HHT-Lys-pNA.2AcOH 


MTSP5 


S 2765 


N-CT.Z-D-Arg-Gly-Arg-pNA.2HCI 


IVITSP6 


spectrozyme t-PA 


CHgSOz-D-HHT-Gly-Arg-pNA.AcOH 


MTSP7 


S 2366 


pyroGiu-Pro-Arg-pNA.HCI 


MTSP9 


Pefachrome A/lta 


CHaSOj-D-CHA-But-Arg-pNA 


MTSPl 0 


spectrozyme t-PA 


CHaSOz-D-HHT-Gly-Arg-pNA.AcOH 


MTSP22 


S 2366 


py roGlu-Pro-Arg-p N A . H CI 


ET-1 


spectrozyme t-PA 


CHsSOa-D-HHT-Gly-Arg-pNA.AcOH 


ET-2 


S 2765 


N-a-Z-D-Arg-Gly-Arg-pNA.2HCI 


u-PA 


S-2444 


pyroGlu-Gly-Arg-pNA.HCI 



15 ^ coupled assay, activation of plasminogen in the presence of Spec 
PL 

Briefly, for a coupled assay, the ability of the protease to activate an 
enzyme, such as plasminogen or trypsinogen is tested. To perform these 
assays, a protease is incubated with a zymogen, such as plasminogen or 

20 trypsinogen, in the presence of a labelled known substrate, such as lys- 
plasminogen or Spec PL (for plasmin), for the zymogen. If protease 
activates the zymogen, the activated enzyme, such as piasmin and 
trypsin, will degrade the substrate, thereby changing the spectral 
properties of the substrate. 

25 Exemplary peptidic substrates 

The following description provides exemplary peptidic substrates 
for cleavage by proteases, such as MTSPl (or matriptase), endotheliase 1 
and urokinase, and a general discussion of properties of the residues. In 
a similar manner, peptidic substrates for cleavage by other cell surface 

30 proteases, or a soluble, shed or released form thereof, can be similarly 
designed by identifying peptidic substrates for the selected protease and 
then preparing conjugates that contain such peptidic substrates. 
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a. The PI Residue 

Amino acid residues for use at the PI position of the peptidic 

substrates for use in the conjugates provided herein include Arg, Arg 

surrogates and Lys. Arg surrogates include unnatural amino acids that 

5 possess a group or moiety that functions in substantially the same way 

as the naturally occurring side chain of arginine to achieve substantially 

the same result {e.g., acting as the PI residue in a substrate for a 

MTSP1, urokinase or endotheliase). Arg surrogates include, but are not 

limited to, a-amino acids that possess as the side chain any of the 

10 following: the side chain of homoarginine; guanidinoaminopropyl; 

guanidinoaminoethyl; (Meljarginine side chain; (Etlzarginine side chain; (4- 

aminomethyOphenylmethyl; 4-amidinophenylmethyl; 4-guanidinophenyl- 

methyl; or the Arg surrogate is a conformationally constrained arginine 

analog such as: 
15 NH 



20 




25 

where z is 0 or 1 (see, e.g., Webb eta/. (1991) J. Org. Chem. 56:3009); 
or the side chain is a conformationally constrained arginine side chain 
analog such as: 

30 



35 
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5 



10 




15 where d is an integer from 0 to 5, or 1 to 3; and W is N or CH; or a 
mono- or di-substituted N-alkyI derivative of the above groups, where 
alkyi is, in certain embodiments, iower alkyi, such as, for example, 
methyl. 

In certain embodiments herein, the PI residue is Arg. 
20 b. The P2 Residue 

In the conjugates provided herein, the P2 residue is selected from 
Phe, Ser, Gly, Ala, Ser(OMe), hSer, 1-methylHis, 3-methylHis, His, nVal, 
nLeu, Abu, (hS)Gly, Thr, Aib, CHA and Tyr. In another embodiment, the 
P2 residue is selected from Phe, Ser, Gly and Ala. In certain 
25 embodiments herein, the P2 residue is Ser or Ala. In another 
embodiment, the P2 residue is Gly or Ala. 
c. The P3 Residue 
Amino acid residues for use at the P3 position of the conjugates 
provided herein include Arg, Lys, Gin, Quat, Arg surrogates, Ser, Thr, 
30 hSer, dSer, Pro, (hSfGly, Tyr, 4,4-dimethylThr, Asn, Met(02), Quat^, 
Quat^, Quat* and Quat^. In another embodiment, the P3 residue is 
selected from Arg, Lys, Gin, Quat and Arg surrogates. Arg surrogates 
include those described above for the PI residue. 

In certain embodiments, the P3 residue is Gin or Ser. 
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d. The P4 Residue 

In the conjugates provided for use in the connpositions and 
methods provided herein, the P4 residue is selected from Pro, Arg, Ser, 
Ala, Lys, Gly, nLeu, Leu, Tyr, Glu, Phe, Val, N,N-dimethylGly, >ff-Ala, 
5 Cys(Me|, Gin, t-butylGly and nVal. In another embodiment, the P4 

residue is selected from Pro, Arg, Ser, Ala, Lys, Gly, nLeu, Leu, Tyr, Glu, 
Phe and Val. In further embodiments, the P4 residue is selected from 
Pro, Arg, Ser, Ala, Lys, Gly, nLeu, Phe or Val. In certain embodiments 
herein, the P4 residue is Arg or Gly. 

10 e. The P5 and P6 Residues 

In certain embodiments herein, the peptidic substrates used in the 
conjugates contain a P5 and, optionally, a P6 residue. P5 residues 
include lie, Arg and Arg surrogates. In another embodiment, P5 residues 
include Arg and Arg surrogates. Arg surrogates include those described 

15 above for the PI residue. P6 residues include, for example. Leu, Val and 
Arg. In another embodiment, P6 residues include, for example. Leu. 
f. The PI' Residue 
The PI ' residue of the conjugates provided herein is Gly, Ser, Ala, 
Leu, He, d-lle, nLeu, Val, nVal, Aib, Abu, Met, 6-aminohexanoyl, Thr or 

20 hSer. In another embodiment, the PV residue of the conjugates provided 
herein is Gly, Ser, Ala, Leu, He, d-lte, nLeu, Val, nVal, Aib, Abu, Met or 6- 
aminohexanoyl. In another embodiment, the PV residue is Ser, Aia, 
hSer, Abu, Thr, Met, nLeu or Val. In another embodiment, the PT 
residue is Gly or Ala. In another embodiment, the PV residue is Ser, Ala 

25 or Gly. In another embodiment, the PI ' residue is Leu, Abu, nLeu, nVal, 
CHA, hCHA, (hex)Gly, (allyDGIy, (propargyl)Gly or (cyclopropyl)Ata. In 
certain embodiments herein, the PV residue is Ala, Ser, Gly, lie or d-Ile. 
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g. The P2' Residue 

In certain embodiments herein, the conjugates provided herein 
possess a P2' residue. P2' residues for use herein include, but are not 
limited to, Gly, Ser, Ala, Leu, lie, d-lle, nLeu, Val, nVal, Aib, Abu, Met, 6- 
5 aminohexanoyi, hCHA, CHA, hexylGiy, allylGly and Phe. In another 
embodiment, P2' residues for use herein include, but are not limited to, 
Gly, Ser, Ala, Leu, lie, d-lle, nLeu, Val, nVal, Aib, Afau, Met and 6- 
aminohexanoyl. In another embodiment, the P2' residue is Ser, hSer, 
Abu, nLeu, nVal, CHA, hCHA, (allyl)Gly or (hexyDGIy. In another 
10 embodiment, the P2' residue is Gly or Ala. In another embodiment, the 
P2' residue is Leu, Abu, nLeu, nVal, CHA, hCHA, {hex)Gly, (allyl)Gly, 
{propargyl)Gly or (cyclopropyl)Ala. In further embodiments, the P2' 
residues are Ala, Gly, lie or d-lle. 

h. The P3' Residue 

15 In other embodiments herein, the peptidic substrates used in the 

conjugates provided herein Include a P3' residue. P3' residues for use 
herein include, but are not limited to, Gly, Ser, Ala, Leu, lie, nLeu, Val, 
nVal, Aib, Abu, Met, 6-aminohexanoyl, CHA and allylGly. In another 
embodiment, the P2' residue is Ser, hSer, Abu, nLeu, nVal, CHA, hCHA, 

20 (allyl)Gly or {hexyDGIy. In another embodiment, P3' residues for use 
herein include, but are not limited to, Gly, Ser, Ala, Leu, He, nLeu, Val, 
nVal, Aib, Abu, Met and 6-aminohexanoyl. In another embodiment, the 
P3' residue is Gly or Ala. In another embodiment, the P3' residue is Leu, 
Abu, nLeu, nVal, CHA, hCHA, (hex}Gly, (allyl)Gly, (propargyl)Gly or 

25 {cyclopropyl)Ala. 

i. The P4' Residue 

In other embodiments herein, the peptidic substrates used in the 
conjugates provided herein include a P4' residue. P4' residues for use 
herein include, but are not limited to, Gly, Ser, Ala, Leu, lie, nLeu, Val, 
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nVal, Aib, Abu, Met, 6-anr)inohexanoyl, CHA and allylGly. In another 
embodiment, P4' residues for use herein include, but are not limited to, 
Gly, Ser, Ala, Leu, Me, nLeu, Val, nVal, Aib, Abu, Met and 6- 
aminohexanoyl. In another embodiment, the P4' residue is Gly or Ala. In 
5 another embodiment, the P4' residue is Leu, Abu, nLeu, nVal, CHA, 

hCHA, (hex}Gly, (allyl)Gly, (propargyl)Gly or (cyclopropyl)Ala. In another 
embodiment, the P4' residue is Leu. 
j. Caps 

1) X" (the N-terminal Cap) 
10 In embodiments herein where the therapeutic agent is conjugated 

to the C-terminus of the peptidic substrate {i.e., where the conjugate has 
formula I), the N-terminus of the peptidic substrate optionally is capped 
with an acyl, sulfonyl or carbamoyl derivative. The cap is chosen, in 
certain embodiments, to increase the hydrophilicity of the conjugate. In 
15 embodiments where the peptidic substrate-therapeutic agent conjugate is 
sufficiently hydrophilic so as not to require further hydrophilicity, a non- 
hydrophilic N-terminal cap, such as an acetyl group, can be used. In 
embodiments where increased hydrophilicity is desired, the N-terminal 
amino acid is modified with a hydrophilic blocking group. Such blocking 
20 groups are chosen based upon the presence of hydrophilic functionality. 
Such blocking of the terminal amino group can also reduce or eliminate 
the enzymatic degradation of such peptidyl therapeutic agents by the 
action of exogenous amino peptidases which are present in the blood 
plasma of warm blooded animals. 
25 N-Terminai blocking groups that increase the hydrophilicity of the 

conjugates and therefore increase the aqueous solubility of the 
conjugates include, but are not limited to, hydroxylated alkanoyi, 
polyhydroxylated alkanoyi, polyethylene glycol, glycosylates, sugars and 
crown ethers. 



wo 02/095007 



PCT/US02/168I9 



-92- 



In certain embodiments herein, the N-termina! blocking group is one 
of the following: 



a) 



O 



HO 




10 

or b) 




15 



where and are selected from (i) or (ii) as follows: 
(i) and are each independently: 

20 a) hydrogen; 

b) unsubstituted or substituted aryl, unsubstituted or 
substituted heterocyclyl, C3-C10 cycloalkyi, Cj-Cg 
alkenyl, C2-C6 alkynyi, halogen, CfCg perfluoroalkyi, 
R'*0-, R^CIONR^-, (R^)2NC{0)-, (R^ljN-CINR^)-, 

25 R*S(0)eNH-, -CN, R^CiO)-, -N3, -NfR^)^, or 

R*0C{0)NR3-; 

c) unsubstituted Ci-Cg alkyi; 

« 

d) substituted Ci-Cg alkyI wherein the substituent on the 
substituted C^-Cq alky I is selected from unsubstituted 

30 or substituted aryl, unsubstituted or substituted 

heterocyclyl, C3-C10 cycloalkyi, Cz-Cg alkenyl, Cs-Cg 
alkynyl, R^O-, R*S(0)^NH-, R3C(0)NR3-, (R^laNCiO)-, 
(R=')2N-C(NR=^)-, -CN, R3C(0)-, -N3, -NIR^)^, and 
R^0C(0)-NR3-; or 



wo 02/095007 



PCT/US02/16819 



-93- 



(ii) and are combined to form -(CH2)f- where one of the 

carbon atoms optionally is replaced by a moiety selected from: -0-, 
-S(0)e-. -NC(0)-, -NH- and -N(COR*)-; 

R^ is selected from: hydrogen, unsubstituted or substituted aryl, 
5 unsubstituted or substituted heterocyciyl, C,-Cg alkyi and C^-C^q 
cycloalkyi; 

R* is selected from: unsubstituted or substituted aryl, unsubstituted 
or substituted heterocyciyl, Ci-Cg alkyI and Cg-C^o cycloalkyi; 
e is 0, 1 or 2; 
10 a is 1, 2, 3 o r 4; 



15 hydrogen, OH, Ci-Cq alkyl, Ci-Cs alkoxy, C^Cq aralkyi or aryl. In these 
embodiments, a is 1, 2, 3 or 4; b is 0 or an integer between 1 and 100; 
and c is 0 to 10, provided that if b is 0, c is 1 to 10. 

In another embodiment, the N-terminal cap (X") is hydrogen, or (i), 
(ii), (hi) or (iv) as follows: 

20 (i) 



b is zero or an integer between I and 1 00; and 

c is 0 to 10, provided that if b is zero, c is 1 to 10; and 

f is 3, 4 or 5. 

In certain embodiments, R' and R^ are each independently 



O 



25 



HO 




or (ii) 



OH 
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or {iii} 

5 

or (iv) 

10 



15 where and are each independently hydrogen, C^-Cq alkyl and aryl; a 

is 1, 2, 3 or 4; a' is 0, 1,2 or 3; b is 0 or an integer between 1 and 14; 

and c is 0 or 1 , provided that if b is 0, c is 1 . 

In another embodiment, X" is R^^O-CiO)-, R^^R^^N-CiO)-, 

R^3(CH2)kC(0)- or H-C(O)-; where k is an integer from 1 to 4, or is 1 or 2; 
20 is alkyl, aryl, heteroaryl, aralkyi or heteroaralkyl; R^^ and R^^ are each 

independently hydrogen, alkyl, aryl, heteroaryl, aralkyi, or heteroaralkyl; 

and R^^ is hydrogen, hydroxy, alkyl, alkenyl, alkynyl, alkoxy, aryl, aryloxy, 

heteroaryl, heteroaryloxy, aralkyi, aralkoxy, heteroaralkyl or 

heteroaralkoxy. 

25 In certain embodiments herein, X" is hydrogen, acetyl, 

hydroxyacetyl, 2,3-dihydroxypropionyl, 2,3,4-trihydroxybutanoyl, PEG(l), 
PEG(2), PEG(4), PEG(6), PEG(14), PEG(15), PEG<16), PEG(17), PEG(18) 
or PEG(19). In other embodiments herein, X" is hydrogen, acetyl, 
hydroxyacetyl, succinyl, quinyl, gallyl, 4-imidazoiy!acetyl, cotininyl, 3- 

30 phosphonylpropionyl, gulonyl, 4-phosphonylbutyryl, glutaryl, 

ethoxysquaryl or PEG(2). In further embodiments, X" is hydrogen, acetyl, 
-C(0)NH2, HOCHjCHsCCO)-, diaminopropanoyi, or NH2-(CH2)5-C(0)-. In 
another embodiment, X" is hydrogen, acetyl, succinyl, glutaryl, PEG(2) or 
malonyl. In another embodiment, X" is hydrogen, acetyl, succinyl, 
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glutaryl, PEG(2), malony!, methoxycarbonyl, phenylsulfonyl, 3- 
methoxypropanoyl, ethoxycarbonyl, isobutoxycarbonyl, 
benzyloxycarbonyl, tert-butoxycarbonyl, 4-oxopentanoyl, 2-(2- 
methoxyethoxy)ethoxy) acetyl, 3,4-nnethylenedioxyphenylacetyI, 2- 
5 pyridylacetyl, phenoxyacetyl, phenylacetyl, methoxyacetyl, 2- 
methoxyethoxycarbonyl, 2-methoxyethoxyacetyl, 3-phenyl-2- 
hydroxypropanoyl, pent-4-ynoyl, 1-naphthylacetyI, hydroxyacetyl, 3- 
methoxycarbonylpropanoyi or formyl. 

In certain embodiments herein, the N-terminal cap (X'^) is acetyl, 

io glutaryl, or related acyl, sulfonyl or carbamoyl derivatives. Capping 

groups include, but are not limited to, a simple N-acetyl residue through 
larger fragments that impact the overall physicochemical properties of the 
conjugate. Appropriate choice of the capping group allows delivery of 
either relatively hydrophilic or hydrophobic molecules to a target site. In 

15 one embodiment, X" is acetyl. 

2) X"" (the C-terminal Cap) 
In embodiments herein where the therapeutic agent is conjugated 
to the N-terminus of the peptidic substrate {/.e., where the conjugate has 
formula II), the C-terminus of the peptidic substrate is a carboxylic acid or 

20 a carboxamide derivative. Appropriate choice of the capping group 

allows delivery of either relatively hydrophilic or hydrophobic molecules to 
a target site. 

In one embodiment, X*^, together with the carbonyi group to which 
it is attached, forms a carboxamide derivative of formula -C(0)NR''R^, 
25 where R*^ and R* are selected from (i) or (ii) as follows: 

(i) R'' and R" are each independently hydrogen, Ci-Cg-alkyl, 
-C,-C6-aIkyl-0H, -CrCg-alkyl-di-OH, -Ci-Ce-alkyl-tri-OH and 
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provided that at least one of R** and are not hydrogen or Ci-Cg-alkyt; or 
(ii) R** and together form a -CH2CH2OCH2CH2- diradical; 
b is zero or an integer between 1 and 1 00; and 
c is 0 or 1 , provided that if b is zero, c is 1 . 
5 In one embodiment, R** is hydrogen and R® is 2-hydroxyethyl. 

2. Linkers 

The conjugates optionally contain a linker {i.e., L, V or of 
formulae I, I) and III) that covalently binds the peptidic substrate to the 
therapeutic agent. The linkers are any that result in a conjugate in which 

10 the peptidic portion is a substrate for a cell surface protease and the 
therapeutic agent is substantially inactive when 'in the conjugate and is 
released in active form or in a form subsequently activated by the cell, 
tissue or environment of the targeted tissue. 

For example, the linker can include of carbohydrate, peptide, 

15 diamine, arylamine, and/or hydrocarbon core structures. Linkers are 
desirably synthetically accessible, provide shelf-stable products, and do 
not possess any intrinsic biological activity that interferes with the 
conjugates activity. They can add desirable properties such as increasing 
solubility or serving to aid in trafficking the cleaved therapeutic agent in 

20 the cell. In certain embodiments, some linkers will be enzymatically 
cleaved m vitro and in vivo, and fragment to release active therapeutic 
agent or activatable therapeutic agent. In embodiments where the 
therapeutic agent is doxorubicin, the linker is, for example, a sugar and/or 
a peptide, such the aminosugar daunosamine. 

25 In one embodiment, linkers for use herein include, but are not 

limited to, a biscarbonyl alkyi diradical whereby an amine moiety on the 
therapeutic agent is connected with the linker unit to form an amide bond 
and the amino terminus of the peptidic substrate is connected with the 
other end of the linker unit also forming an amide bond. Conversely, a 
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diaminoalkyi diradical linker unit, whereby a carbonyl moiety on the 
cytotoxic agent is covalently attached to one of the amines of the linker 
unit while the other amine of the linker unit is covalently attached to the 
C-terminus of the peptidic substrate, also can be useful. Other such linker 
5 units which are stable to the physiological environment when not in the 
presence of a cell surface protease, but are cleavable upon the cleavage 
of the cell surface protease proteolytic cleavage site, are intended for use 
herein. Furthermore, linker units can be utilized that, upon cleavage of the 
cell surface protease proteolytic cleavage site, remain attached to the 

10 therapeutic agent but do not significantly decrease the therapeutic 
activity of such a post-cleavage therapeutic agent derivative when 
compared with an unmodified therapeutic agent. 

In other embodiments, the linker is a diamine containing a cyclic 
alkyi moiety and, in certain embodiments, the diamine contains a 

15 bicycloalkylene moiety. Examples of such diamine linkers include, but are 
not limited to, 1 ,4-bis(aminomethyl)cyclohexane, 1 ,4-bis(amlnomethyl)- 
cycloheptane, 1 ,3-bis{aminomethy!)cyclopentane, 1 -amino-4-(amino- 
methyDcyctohexane, 1 ,4-diaminocyclohexane and 1 ,4-bis(aminomethyl)- 
bicycto[2.2.2]octane. 

20 Other [inkers include 1 ,a;-diaminoalkanes, including, but not limited 

to, 1,3-diaminopropane, and 1 ,a;-dicarbonylaikanes, including, but not 
limited to, oxalic, malonic, succinic, glutaric, adipic and pivalic acids. 

Further linkers for use in the conjugates provided herein include 
self-eliminating linkers such as those of the following formulae: 

25 



30 
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where A is NH or O; D is N(H or alkyi) or O; R^^ is H, alkyl, cycloalkyi, 
cycloalkylalkyL aryl, heteroaryl optionally substituted with 1 or more, 
such as 1 to 3, substituents selected from halo, halo alkyI and alkyI, 
aralkyi, heteroaralkyl, alkenyl containing 1 to 2 double bonds, alkynyl 
containing 1 to 2 triple bonds, alk(en)(yn)yl groups, halo, pseudohalo, 
cyano, hydroxy, haloalkyi and poiyhaloalkyl, such as, for example, halo 
lower alkyl, especially trifiuoromethyl, formyl, alkylcarbonyl, arylcarbonyl 
that optionally is substituted with 1 or more, such as, for example, 1 to 
3, substituents selected from, for example, halo, halo alkyl and alkyl, 
heteroarylcarbonyl, carboxy, alkoxycarbonyl, aryloxycarbonyl, 
aminoimino, alkoxycarbonylamino, aryloxycarbonylamino, aminocarbonyl, 
alkylaminocarbonyl, dialkylaminocarbonyl, arylaminocarbonyl, 
diarylaminocarbonyl, aralkylaminocarbonyl, alkoxy, aryloxy, 
perfluoroalkoxy, alkenyloxy, alkynyloxy, arylalkoxy, aminoalkyi, alkyl- 
aminoalkyl, dialkylaminoalkyi, arylaminoalkyi, amino, alkylamino, dialkyl- 
amrno, arylamlno, alkylarylamino, alkylcarbonylamino, arylcarbonylamino, 
azido, nitro, mercapto, alkylthio, arylthio, perfluoroalkylthio, thiocyano. 
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isothiocyano, alkylsulfinyi, alkylsulfonyl, arylsulfinyl, arylsulfonyl, 
aminosulfonyl, alkylaminosulfonyl, dialkylamtnosulfonyl and arylamino- 
sulfonyl.; and y is an integer from 1 to 3. 
3. Therapeutic agents 
5 The conjugates are intended for modifying a variety of biological 

responses. Accordingly, the therapeutic agents are any agents, including 
proteins and polypeptides, small molecules and other molecules that 
■possess or potentiate a desired biological activity. Such molecules 
include cytotoxic agents, such as, but are not limited to, a toxin such as 

10 abrin, ricin A, pseudomonas exotoxin, shiga toxin, diphtheria toxin and 
other such toxins and toxic portions and/or subunits or chains thereof; 
proteins such as, but not limited to, tumor necrosis factor, a-interferon, 
K-interferon, nerve growth factor, platelet derived growth factor, tissue 
plasminogen activator; or, biological response modifiers such as, for 

15 example, lymphokines, interleukin- I flL-1), interleukin-2 (iL-2), 

interleukin-6 {!L-6), granulocyte macrophage colony stimulating factor 
(GMCSF), granulocyte colony stimulating factor (G-CSF), erythropoietin 
(EPO), pro-coagulants such as tissue factor and tissue factor variants, 
pro-apoptotic agents such FAS-ligand, fibroblast growth factors (FGF), 

20 nerve growth factor and other growth factors. Each must be in a form 
that can enter a cell or otherwise exert a therapeutic effect when in the 
vicinity thereof. 

Thus, therapeutic agents, include, but are not limited to, anti- 
tumor, anti-angiogenic, pro-apoptotic, anti-cancer and anti-mitotic agents. 
25 These are conjugated, optionally via a linker, to a substrate, such as 
peptidic substrate, which is a substrate for the protease. 

Among the therapeutic agents are cytotoxic agents that include, in 
general, but are not limited to, alkylating agents, toxins, antiproliferative 
agents and tubulin binding agents. Classes of cytotoxic agents for use 
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10 



15 



20 



herein include, for example, the anthracycline family of drugs, the vinca 
drugs, the mitomycins, the bleomycins, the cytotoxic nucleosides, the 
pteridine family of drugs, diynenes, the maytansinoids, the epothilones, 
the taxanes and the podophyllotoxins. 

Exemplary members of those classes include, for example, 
doxorubicin, carminomycin, daunorubicin, aminopterin, methotrexate, 
methopterin, dichloro-methotrexate, mitomycin C, porfiromycin, 
5-fluorouracil, 6-mercapto purine, cytosine arabinoside, podophyllotoxin, 
or podophyllotoxin derivatives such as etoposide or etoposide phosphate, 
melphalan, vinblastine, vincristine, leurosidine, vindesine, leurosine, 
maytansinol, epothilone A or B, taxotere, taxol and the like. Other such 
therapeutic agents include estramustlne, cisplatin, combretastatin and 
analogs, and cyclophosphamide. One skilled in the art can make 
chemical modifications to the desired therapeutic agent in order to make 
reactions of that compound more convenient for purposes of preparing 
the conjugates. 

Particular therapeutic agents include the following drugs. One 
skilled in the art understands that these structural formulae are exemplary 
only and that such compounds or derivatives or analogs thereof have 
acquired in the art different generic or trivial names. 




R' 



30 



in which 

R^^ is amino or hydroxy; 
is hydrogen or methyl; 
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10 



is hydrogen, fluoro, chloro, bromo or iodo; 
is hydroxy or a moiety which completes a salt of the 
carboxylic acid. 

b. The mitomycin group of formula (2): 



H,N 



CHjOCONH, 




15 



20 



25 



30 



in which R^° is hydrogen or methyl. 

c. The bieomycin group of formula (3): 

CONR, NH^ 

CONH, 

II , CH3 O N— 



HO 




CH3NH X i ' i k^iTJ 

NH CH3 HO ^^3^ S 



OH Q 



O 




R 



II 



OH ^^CONH, 



in which R" is hydroxy, amino, C1-C3 alkylamino, diCCi-Cg alkyDamino. 
C^-Cg polymethylene amino, -NHCH2CH2CH2CH2NH-C{NH)NH2 or 
-NHCH2CH2CH2S^(CH3)2. 
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d. Melphalan of formula (4): 



HO.CCHCH^ ^ — NCCH^CH^Cl), 

5 NH, 




SH 



10 



N 




15 



e. Mercaptopunne of formula (5): 
NH 

> 

N 

f. Cyotosine arabinoside of formula (6): 



20 




HOH^C. n 



N . 



25 



/ \ 
HO OH 



g. Podophyllotoxins of formula (7): 



30 



35 



40 
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in which 

R^^ is hydrogen or methyl; and 

R^'* is methyl or thienyl or a phosphate salt thereof. 

h. The vinca alkaloid group of drugs of formula (8): 





CO2CH3 

20 

in which 

when R^^ and R^^ are taken singly, R'^ is H, CH3 or CHO; and 
R'^ is and one of R^^ and R^' is ethyl and the other is H 
or OH; 

25 when R^' and R^^ are taken together with the carbons to which 

they are attached, they form an oxirane ring in which case R^^ is ethyl; 
and 

R^^ is hydrogen, (C1-C3 alkyl)-CO, or chlorosubstituted 
(C1-C3 alkyI)-CO. 

30 The conjugates provided herein where the therapeutic agent Is the 

vinca alkaloid vinblastine include those of formula: 



35 
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5 



10 




L - peptide - X" 



where the peptidic substrate is as described above for formulae I and II; L 
Is a linker such as -NH-{CH2)u-T-(CH2)^-NH-; X" is 
20 a) hydrogen, 

b) -(C = 0)Ri% 

c) 



25 



HO 




30 



35 



40 



d) 



e) 



O 



o 



HO 



o o 



f) 



ethoxysquarate; and 
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g) 



cotininyl; 



and are independently hydrogen, OH, C^~Cq alkyi, Ci-Cg 
alkoxy, C^-Cq aralkyi and aryl; 

R^^ is Ci-Cg-alkyl, hydroxylated Cg-Cg-cycloalkyI, poiyhydroxylated 
5 Cg-Ca-cycloalkyl, hydroxylated aryl, poiyhydroxylated aryl or aryl, 

R^^ is hydrogen, (C1-C3 alkyl)-CO, or chlorosubstituted (C1-C3 
alkyi)-CO; 

T is selected from cyclopentyl, cyclohexyl, cycloheptyl or 
bicyclo[2.2.23octanyl; 
0 a is 1 , 2, 3 or 4; 



5 



b is zero or an integer between 1 and 1 00; 
c is 0 or 1 , provided that if b is zero, c is 1 ; 
g is 1 , 2 or 3; 
u is 0, 1 , 2 or 3; 

or a pharmaceutically acceptable derivative thereof. 



■ 

I. 



Difluoronudeosides of formula (9): 



R 



21 




F — 



\ 



F 



OH 



in which R 



is a base of one of the formulae: 
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O 



HN 




O 



R 



23 



R 



22 



HN 




N 



> 




N 



CH=CHR 



24 



10 



NH. 



15 



N 




R 



22 




20 



R^^ is hydrogen, methyl, bromo, fluoro, chioro or iodo; 
R" is -OH or -NH2; 
R^'^is hydrogen, bromo, chioro or iodo. 
j. Estramustine (10): 

OH 



25 




k. Cyclophosphamide (11): 



35 



N(CH,CH,C1)3 
.NH 



wo 02/095007 



PCT/IIS02/168I9 



107 



10 



Anthracycline antibiotics of formula (12); 
OH O 




15 in which 

R« is -CH3, -CH2OH, -CH20CO{CH2)3CH3, or 
-CH20COCH(OC2Hs)2; 

R'* is -OCH3, -OH or -H; 

R*" is -NH2, -NHCOCF3, 4-morpholinyl, 3-cyano-4-morpholinyl, 
20 1-pjperidinyl, 4-methoxy-1 -piperidinyl, benzylamine, dibenzylamine, 
cyanomethylamine, or 1 -cyano-2-methoxyethyl amine; 
is -OH -OTHP or -H; and 
R^ is -OH or -H provided that R6 is not -OH when R^ is rOH or 
-OTHP. 

25 Table 2, which follows, provides a number of anthracycline drugs 

and their generic or trivial names: 



30 



35 




Compound 



R' 



R-^ 
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10 



15 



20 



25 



daunorubicin" 


CH3 


OCH3 


NH2 


OH 


H 


doxorubicin'* 


CH2OH 


OCHa 


NH2 


OH 


H 


detorubicin 


CH20COCH{OC2Hs)2 


OCH3 


NH2 


OH 


H 


carminomycin 


CH3 


OH 


NH2 


OH 


H 


idarubicin 


CH3 


H 


NH2 


OH 


H 


epirubicin 


CHjOH 


OCH3 


NH2 


OH 


OH 


esorubicin 


CH2OH 


OCH3 


NH2 


H 


H 


THP 


CHjOH 


OCH3 


NH2 


OTHP 


H 


AD-32 


CH20C0(CH2)3CH3 


OCH3 


NHCOCF3 


OH 


H 



• daunorubicin is an alternative nanne for daunomycin 
doxorubicin is an alternative name for adriamycin. 

In one embodiment, when the therapeutic agent is doxorubicin, it is 

conjugated to the peptidic substrate via the amino group of the 

aminoglycoside moiety of doxorubicin. 

m. Maytansinol 

Me 




Mei''' 



OMe 



30 



n. 



Epothilone A or B 



35 



40 
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lo 



15 



20 




,,tilVle 
Me. Me"^ OH 

Me 




O OH O 
A 

Me 




25 



30 



35 



o. Taxols 




40 



where R is PhC(O) or t-BuOC(O). 

In one embodiment, when the therapeutic agent is taxol (R = 
C(O)Ph), the peptidic substrate is conjugated to the secondary hydroxyl 
group of the cyclohexane moiety of taxol. 
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p. Ribosome-inactivating proteins 

Ribosome-inactivating proteins (RIPs), which include ricin, abrin and 
saporin, are plant proteins that catalytically inactivate eukaryotic 
ribosomes. RIPS inactivate ribosonnes by interfering with the protein 
5 elongation step of protein synthesis. For example, the RIP saporin 
(hereinafter also referred to as SAP) has been shown to enzymatically 
inactivate 60S ribosomes by cleavage of the n-glycosidic bond of the 
adenine at position 4324 in the rat 285 ribosomal RNA (rRNA). Some 
RIPs, such as the toxins abrin and ricin, contain two constituent chains: 

10 a cell-binding chain that mediates binding to cell surface receptors and 
internalization of the molecule; and an enzymatically active chain 
responsible for protein synthesis inhibitory activity. Such RIPs are type H 
RIPs. Other RIPs, such as the saporins, are single chains and are 
designated type I RIPs. Because such RIPs lack a cell-binding chain, they 

15 are less toxic to whole cells than the RIPs that have two chains. Two 
chain RIPs are generally used for conjugation herein, unless a single chain 
is further conjugated to an agent, such as a growth factor that mediates 
binding and internalization. 

Several structurally related RIP's have been isolated from seeds and 

20 leaves of the plant Saponaha officinalis (soap wort). Among these, SAP-6 
is the most active and abundant, representing 7% of total seed proteins. 
Saporin is very stable, has a high isoelectric point, does not contain 
carbohydrates, and is resistant to denaturing agents, such as sodium 
dodecyl sulfate (SDS), and a variety of proteases. The amino acid 

25 sequences of several saporin-6 isoforms from seeds are known and there 
appear to be families of saporin RIPs differing in a few amino acid 
residues. Because saporin is a type I RIP, it does not possess a cell- 
binding chain. Consequently, its toxicity to whole cells is much lower 
than the other toxins, such as ricin and abrin. When internalized by 
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eukaryotic cells, however, its cytotoxicity is 100- to 1000-fold more 
potent than rtcin A chain. 

4. Exemplary Conjugates 

The conjugates provided herein, are prepared by identifying suitable 
5 peptidic substrates for the targeted cell surface protease, or a soluble, 
shed or released form thereof, and forming a conjugate of the peptidic 
substrate(s) with a therapeutic agent(s)- Exemplary conjugates, 
containing peptidic substrates designed, for example, for cleavage by 
MTSPl, endotheliase 1 and urokinase, are described- It is understood 

10 that upon identification of a cell surface protease, including cell- 
associated and cell-localized proteases, or a soluble, shed or released 
form thereof, in or associated with a cell involved in a disease or other 
conditions of interest, or with a cell present in the vicinity of a celt or 
tissue involved in or associated with a disease or other condition of 

15 interest, suitable peptidic substrates therefor can be emplricaily designed 
and then conjugated to therapeutic agents as exemplified herein. 

In certain embodiments, the conjugates for use in the compositions 
and methods provided herein include: 

Ac-Leu-Arg-Ala-Quat-Gly-Arg-Ala-A!a-{therapeutic agent) (SEQ ID NO: 
20 46); 

Ac-Leu-Arg-Ala-Quat-Ala-Arg-Ala-Ala-{therapeutic agent) (SEQ ID NO: 

47) ; 

Ac-Leu-Arg-Ser-Quat-Gly-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 

48) ; 

25 Ac-Leu-Arg-Ser-Quat-Ala-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 

49) ; 

Ac-Leu-Arg-Pro-Arg-Phe-Lys-lle-lle-{therapeutic agent) (SEQ ID NO: 50); 
Ac-Arg-Pro-Arg-Phe-Lys-lle-lle-(therapeutic agent) (SEQ ID NO: 51); 
Ac-Pro-Arg-Phe-Lys-lle-lle-{therapeutic agent) (SEQ ID NO: 52); 
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Ac-Leu-Arg-Ser-Lys-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 53); 
Ac-Arg-Ser-Lys-Ser-Arg-Ala-Ala-(therapeutic agent} (SEQ ID NO: 54); 
Ac-Ser-Lys-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 55); 
Ac-Leu-Arg-Pro-Arg-Phe-Arg-lle-lle-(therapeutic agent) (SEQ ID NO: 56); 
5 Ac-Arg-Pro-Arg-Phe-Arg-lle-lle-(therapeutic agent) (SEQ ID NO: 57); 
Ac-Pro-Arg-Phe-Arg-lle-IIe-(therapeutic agent) (SEQ ID NO: 58); 
Ac-Leu-Arg-Ser-Arg-Ser-Arg-Ala-Afa-(therapeutic agent) (SEQ ID NO: 59); 
Ac-Arg-Ser-Arg-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 60); and 
Ac-Ser-Arg-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 61). 

10 in further embodiments herein, the conjugates are Ac-Leu-Arg-Ala- 

Quat-Giy-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 62); Ac-Leu-Arg- 
Ala-Quat-Ala-Arg-Ata-Ala-(therapeutic agent) (SEQ ID NO: 63); Ac-Leu- 
Arg-Ser-Quat-Gly-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 64); and 
Ac-Leu-Arg-Ser-Quat-Ala-Arg-AIa-Ala-(therapeutic agent) (SEQ ID NO: 

15 65). 

In other embodiments herein, the conjugates are 

Ac-Leu-Arg-Pro-Arg-Phe-Lys-lle-lle-(therapeutic agent) (SEQ ID NO: 66); 

Ac-Arg-Pro-Arg-Phe-Lys-lle-lle-(therapeutic agent) (SEQ ID NO: 67); 

Ac-Pro-Arg-Phe-Lys-lle-Ile-(therapeutic agent) (SEQ ID NO: 68); 
20 Ac-Leu-Arg-Ser-Lvs-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 69); 

Ac-Arg-Ser-Lys-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 70); 

Ac-Ser-Lys-Ser-Arg-Ala-Aia-(therapeutic agent) (SEQ ID NO: 71); 

Ac-Leu-Arg-Pro-Arg-Phe-Arg-lle-IIe-(therapeutic agent) (SEQ ID NO: 72); 

Ac-Arg-Pro-Arg-Phe-Arg-lle-lle-(therapeutic agent) (SEQ ID NO: 73); 
25 Ac-Pro-Arg-Phe-Arg-lle-lle-(therapeutrc agent) (SEQ ID NO: 74); 

Ac-Leu-Arg-Ser-Arg-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 75); 

Ac-Arg-Ser-Arg-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 76); and 

Ac-Ser-Arg-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 77). 
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In other embodiments, the conjugates for use herein include the 
following: 

pyroGiu-Pro-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 78); 
CHaSOj-D-HHT-Gly-Arg-Ala-Ala-Kherapeutic agent) (SEQ ID NO: 79); 
5 N-p-tosyl-Gly-Pro-Arg-Ala-AIa-{therapeutic agent) {SEQ ID NO: 80); 
Benzoyl-Val-Gly-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 81); 
CH3S02-D-HHT-GIy-Arg-Ala-Ala-{therapeut(c agent) (SEQ ID NO: 82); 
N-GT-Z-D-Arg-Giy-Arg-Aia-Ala-ltherapeutic agent) (SEQ ID NO: 83) (Z = 
benzyloxycarbonyl); 
10 pyroGlu-Gly-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 84); 
H-D-l!e-Pro-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 85); 
Cbo-L-(/)G!u(c7-t-BuO)-Gly-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 
86) (Cbo = carbobenzoxy); 

H-D-Pro-Phe-Arg-AIa-Ala-(therapeutic agent) (SEQ ID NO: 87); 
15 H-D-Val-Leu-Arg-A!a-Ala-(therapeutic agent) (SEQ ID NO: 88); 

Bz-lle-Glu(K-OH)-Gly-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 89) (Bz 
= benzoyl); 

Bz-lle-Glu(K-OMe)-Gly-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 90); 

Benzoyl-Pro-Phe-Arg-Ala-Ala-(therapeutic agent) (SEQ' ID NO: 91); 
20 H-D-Phe-Pip-Arg-Ala-Ala-(therapeutic agent) fSEQ ID NO: 92); 

H-D-Val-Leu-Lys-Ala-Ala-(therapeutic agent) (SEQ ID NO: 93); 

H-D-Nle-HHT-Lys-Ala-Ala-(therapeutic agent) (SEQ ID NO: 94); 

Pyr-Arg-Thr-Lys-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 95); 

H-Arg-Gln-Arg-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 96); 
25 Boc-Gln-G!y-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 97); 

Z-Arg-Arg-Ala-AIa-(therapeutic agent) (SEQ ID NO: 98); 

H-D-HHT-Ala-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 99); 

H-D-CHT-Gfy-Arg-Ala-A!a-(therapeutic agent) (SEQ ID NO: 100); 

MeS02-dPhe-Pro-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 101); 
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<J-Z-D-Lys-Pro-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 102); and 
CHaSOj-D-CHA-But-Arg-Ala-Ala-ltherapeutic agent) (SEQ ID NO: 103). 

In another embodinnent, the conjugates for use in the compositions 
and methods provided herein include: 
5 Ac-Arg-Gln-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 104); 
Ac-Arg-Arg-Gln-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 105); 
Ac-Leu-Arg-Arg-Gln-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 
106); 

Ac-Arg-Gln-Ser-Arg-Ala-(therapeutic agent) (SEQ ID NO: 107); 
10 AG-Arg-Arg-Gln-Ser-Arg-Ala-(therapeutic agent) (SEQ ID NO: 108); 
Ac-Leu-Arg-Arg-Gln-Ser-Arg-Gly-Gly-(therapeutic agent) (SEQ ID NO: 
109); 

Ac-Leu-Arg-Arg-Gln-Ser-Arg-Ala-(therapeutic agent) (SEQ ID NO: 110); 
Ac-Arg-Arg-Gln-Ser-Arg-lle-(therapeutic agent) (SEQ ID NO: 111); and 
15 Ac-Leu-Arg-Arg-Gln-Ser-Arg-Ala-lle-(therapeutic agent) (SEQ ID NO: 112). 

In certain embodiments, the conjugates for use in the compositions 
and methods provided herein include: 

Ac-Leu-Arg-Ala-Quat-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

113) ; 

20 Ac-Leu-Arg-Ala-Quat-Ala-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

114) ; 

Ac-Leu-Arg-Ser-Quat-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

115) ; 

Ac-Leu-Arg-Ser-Quat-Ala-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
25 116); 

Ac-Leu-Arg-Pro-Arg-Phe-Lys-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
117); 

Ac-Arg-Pro-Arg-Phe-Lys-Ser-Leu-(therapeutic agent) (SEQ ID NO: 118); 
Ac-Pro-Arg-Phe-Lys-Ser-Leu-(therapeutic agent) (SEQ ID NO: 119); 
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Ac-Leu-Arg-Ser-Lys-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
120); 

Ac-Arg-Ser-Lys-Ser-Arg-Ser-Leu-{therapeutrc agent) (SEQ ID NO: 121); 
Ac-Ser-Lys-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 122); 
5 Ac-Leu-Arg-Pro-Arg-Phe-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
123); 

Ac-Arg-Pro-Arg-Phe-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 124); 
Ac-Pro-Arg-Phe-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 125); 
Ac-Leu-Arg-Ser-Arg-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
10 126); 

Ac-Arg-Ser-Arg-Ser-Arg-Ser-Leu-(therapeutic agent) {SEQ ID NO: 127); 
and 

Ac-Ser-Arg-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 128). 

In further embodiments herein, the conjugates are Ac-Leu-Arg-AIa- 
15 Quat-GIy-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 129); Ac-Leu-Arg- 
Afa-Quat-Ala-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 130); Ac-Leu- 
Arg-Ser-Quat-GIy-Arg-Ser-Leu-{therapeutic agent) (SEQ ID NO: 131); and 
Ac-Leu-Arg-Ser-Quat-Ala-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

132) . 

20 In other embodiments herein, the conjugates are 

Ac-Leu-Arg-Pro-Arg-Phe-Lys-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

133) ; 

Ac-Arg-Pro-Arg-Phe-Lys-Ser-Leu-(therapeutrc agent) (SEQ ID NO: 134); 
Ac-Pro-Arg-Phe-Lys-Ser-Leu-(therapeutic agent) (SEQ ID NO: 135); 
25 Ac-Leu-Arg-Ser-Lys-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
136); 

Ac-Arg-Ser-Lys-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 137); 
Ac-Ser-Lys-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 138); 



wo 02/095007 



PCT/US02/16819 



-116- 



Ac-Leu-Arg-Pro-Arg-Phe-Arg-Ser-Leu-(therapeutic agent} (SEQ ID NO: 
139); 

Ac-Arg-Pro-Arg-Phe-Arg-Ser-Leu-(therapeutic agent) {SEQ ID NO: 140); 
Ac-Pro-Arg-Phe-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 141); 
5 Ac-Leu-Arg-Ser-Arg-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
142); 

Ac-Arg-Ser-Arg-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 143); 
and 

Ac-Ser-Arg-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 144). 
10 In other ennbodiments, the conjugates for use herein Include the 

following: 

pyroGlu-Pro-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 145); 
CH3S02-D-HHT-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 146); 
N-p-tosyl-Gly-Pro-Arg-Ser-Leu-{therapeutic agent) (SEQ ID NO: 147); 
15 Ben2oyI-Val-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 148); 
CH3S02-D-HHT-Gly-Arg-Ser-Leu-(therapeutlc agent) (SEQ ID NO: 149); 
N-ar-Z-D-Arg-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 150) (2 = 
benzyloxycarbonyl); 

pyroGlu-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 151); 
20 H-D-lle-Pro-Arg-Ser-Leu-(therapeutlc agent) (SEQ ID NO: 152); 

Cbo-L-(K)Glu(a-t-BuO)-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
153) (Cbo = carbobenzoxy); 

H-D-Pro-Phe-Arg-Ser-Leu-(therapeutic agent) {SEQ ID NO: 154); 
H-D-Val-Leu-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 155); 
25 Bz-lle-GIu{K-OH)-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 156) 
(Bz = benzoyl); 

Bz-lle-Glu(K-OMe)-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 157); 
Benzoyl-Pro-Phe-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 158); 
H-D-Phe-Pip-Arg-Ser-Leu-(therapeutlc agent) (SEQ ID NO: 159); 
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H-D-Val-Leu-Lys-Ser-Leu-ltherapeutrc agent) (SEQ ID NO: 160); 
H-D-Nle-HHT-Lys-Ser-Leu-{therapeutic agent) (SEQ ID NO: 161); 
Pyr-Arg-Thr-Lys-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 162); 
H-Arg-Gln-Arg-Arg-Ser-Leu-{therapeuttc agent) (SEQ ID NO: 163); 
5 Boc-Gln-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 164); 
Z-Arg-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 165); 
H-D-HHT-Ala-Arg-Ser-Leu-(t)ierapeutic agent) (SEQ ID NO: 166); 
H-D-CHT-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 167); 
MeSOj-dPhe-Pro-Arg-Ser-Leu-ltherapeutic agent) (SEQ ID NO: 168); 
10 ^-Z-D-Lys-Pro-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 169); and 
CHgSOj-D-CHA-But-Arg-Ser-Leu-itherapeutic agent) (SEQ ID NO: 170). 

In another embodiment, the conjugates for use in the compositions 
and methods provided herein include: 

Ac-Arg-Gln-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 171); 
15 Ac-Arg-Arg-Gln-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 172); 
Ac-Leu-Arg-Arg-Gln-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
173); 

Ac-Arg-Gln-Ser-Arg-Leu-(therapeutic agent) (SEQ ID NO: 174); 
Ac-Arg-Arg-Gln-Ser-Arg-Leu-(therapeutic agent) (SEQ ID NO: 175); 
20 Ac-Leu-Arg-Arg-Gln-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
176); 

Ac-Leu-Arg-Arg-Gln-Ser-Arg-Leu-(therapeutic agent) (SEQ ID NO: 177); 
Ac-Arg-Arg-Gln-Ser-Arg-Leu-(therapeutic agent) (SEQ ID NO: 178); and 
Ac-Leu-Arg-Arg-Gln-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
25 179). 

In other embodiments, the conjugates provided herein include: 
Ac-Arg-Gln-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 180); 
Ac-Arg-Gln-Ala-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 181); 
Ac-Arg-Gln-Phe-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 182); 
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Ac-Arg-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 183); 
Ac-Arg-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 184); 
Ac-Arg-Ala-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 185); 
Ac-Arg-Phe-Arg-Ser-Leu-{therapeutic agent) (SEQ ID NO: 186); 
5 Ac-Gln-Ser-Arg-Ser-Leu-itherapeutic agent) (SEQ ID NO: 187); 
Ac-Gln-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 188); 
Ac-Gln-Ala-Arg-Ser-Leu- (therapeutic agent) (SEQ ID NO: 189); and 
Ac-Gln-Phe-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 190). 

In further embodiments, the conjugates for use in the compositions 
10 and methods provided herein include: 

Ac-Leu-Arg-Ala-Quat-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ (D NO: 

191) ; 

Ac-Leu-Arg-Ala-Quat-Ala-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

192) ; 

15 Ac-Leu-Arg-Ser-Quat-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

1 93) ; 

Ac-Leu-Arg-Ser-Quat-Ala-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

194) ; 

Ac-Leu-Arg-Pro-Arg-Phe-Lys-Ser-Ser-Leu-(therapeutic agent) {SEQ ID NO: 
20 195); 

Ac-Arg-Pro-Arg-Phe-Lys-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
196); 

Ac-Pro-Arg-Phe-Lys-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 197); 

Ac-Leu-Arg-Ser-Lys-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
25 198); 

Ac-Arg-Ser-Lys-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
199); 

Ac-Ser-Lys-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 200); 
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Ac-Leu-Arg-Pro-Arg-Phe-Arg-Ser-Ser-Leu-(therapeLJtic agent) (SEQ ID NO: 

201) ; 

Ac-Arg-Pro-Arg-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

202) ; 

5 Ac-Pro-Arg-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 203); 
Ac-Leu-Arg-Ser-Arg-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

204) ; 

Ac-Arg-Ser-Arg-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

205) ; and 

10 Ac-Ser-Arg-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 206). 

In further embodiments herein, the conjugates are Ac-Leu-Arg-Ala- 
Quat-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 207); Ac-Leu- 
Arg-Ala-Quat-Ala-Arg-Ser-Ser-Leu-{therapeutic agent) (SEQ ID NO: 208); 
Ac-Leu-Arg-Ser-Quat-Gly-Arg-Ser-Ser-Leu-(therapeutlc agent) (SEQ ID NO: 
15 209); and Ac-Leu-Arg-Ser-Quat-Ala-Arg-Ser-Ser-Leu-(therapeutlc agent) 
(SEQ ID NO: 210). 

in other embodiments herein, the conjugates are 

Ac-Leu-Arg-Pro-Arg-Phe-Lys-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

211) ; 

20 Ac-Arg-Pro-Arg-Phe-Lys-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

212) ; 

Ac-Pro-Arg-Phe-Lys-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 213); 

Ac-Leu-Arg-Ser-Lys-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

214) ; 

25 Ac-Arg-Ser-Lys-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

215) ; 

Ac-Ser-Lys-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 216); 

Ac-Leu-Arg-Pro-Arg-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
217); 
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Ac-Arg-Pro-Arg-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID MO: 
218); 

Ac-Pro-Arg-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 219); 
Ac-Leu-Arg-Ser-Arg-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
5 220); 

Ac-Arg-Ser-Arg-Ser-Arg-Ser-Ser-Leu-{therapeutic agent) (SEQ ID NO: 
221); and 

Ac-Ser-Arg-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 222). 
In other embodiments, the conjugates for use herein include the 
10 following: 

pyroGlu-Pro-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 223); 
CH3S02-D-HHT-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
224); 

N-p-tosyl-G(y-Pro-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 225); 
15 Ben2oyI-Val-Gly-Arg-Ser-Ser-Leu-{therapeutlc agent) (SEQ ID NO: 226); 
CHgSOz-D-HHT-Gly-Arg-Ser-Ser-Leu-dherapeutic agent) (SEQ ID NO: 
227); 

N-a-Z-D-Arg-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 228) (2 
= benzyloxycarbonyl); 
20 pyroGlu-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 229); 
H-D-Ile-Pro-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 230); 
Cbo-L-(K)Glu(a-t-BuO)-GIy-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID 
NO: 231) (Cbo = carbobenzoxy); 

H-D-Pro-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 232); 
25 H-D-Val-Leu-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 233); 
Bz-lle-Glu(K-OH)-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

234) (Bz = benzoyl); 

Bz-lle-Glu(K-OMe)-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

235) ; 
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Benzoyl-Pro-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 236); 
H-D-Phe-Pip-Arg-Ser-Ser-Leu-(therapeutrc agent) (SEQ ID NO: 237); 
H-D-VaI-Leu-Lys-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 238); 
H-D-Nle-HHT-Lys-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 239); 
5 Pyr-Arg-Thr-Lys-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 240); 
H-Arg-Gln-Arg-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 241); 
Boc-Gln-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 242); 
Z-Arg-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 243); 
H-D-HHT-Ala-Arg-Ser-Ser-Leu- (therapeutic agent) (SEQ ID NO: 244); 
10 H-D-CHT-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 245); 

MeS02-dPhe-Pro-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 246); 
<y-Z-D-Lys-Pro-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 247); and 
CH3S02-D-CHA-But-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
248). 

15 In another embodiment, the conjugates for use in the compositions 

and methods provided herein include: 

Ac-Arg-Gln-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 249); 
Ac-Arg-Arg-GIn-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

250) ; 

20 Ac-Leu-Arg-Arg-Gln-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

251) ; 

Ac-Arg-Gln-Ser-Arg-Leu-(therapeutjc agent) (SEQ ID NO: 252); 
Ac-Arg-Arg-Gln-Ser-Arg-Leu-(therapeutic agent) (SEQ ID NO: 253); 
Ac-Leu-Arg-Arg-Gln-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
25 254); 

Ac-Leu-Arg-Arg-Gln-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
255); 

Ac-Arg-Arg-Gln-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 256); 
and 
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Ac-Leu-Arg-Arg-Gin-Ser-Arg-Ser-Ser-Leu-{therapeutic agent) (SEQ ID NO: 
257). 

In other embodiments, the conjugates provided herein include: 

Ac-Arg-Gln-Gly-Arg-Ser-S€r-Leu-(therapeutic agent) (SEQ ID NO: 258); 
B Ac-Arg-Gln-Aia-Arg-Ser-Ser-Leu-{therapeutic agent) (SEQ ID NO: 259); 

Ac-Arg-Gln-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 260); 

AG-Arg-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 261); 

Ac-Arg-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 262); 

Ac-Arg-Ala-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 263); 
10 Ac-Arg-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 264); 

Ac-Gln-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 265); 

Ac-Gln-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 266); 

Ac-Gln-AIa-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 267); and 

Ac-Gln-Phe-Arg-Ser-Ser-Leu-(therapeutiG agent) (SEQ ID NO: 268). 
15 In another embodiment, the conjugates provided herein include: 

Ac-Gly-dSer-Ala-Arg-Ser-Ala-(therapeutic agent) (SEQ ID NO: 569); 

Ac-Arg-Gly-dSer-Ala-Arg-Ser-Ala-(therapeutic agent) (SEQ ID NO: 570); 

Ac-Gly-Ser-GIy-Arg-Ser-AIa-(therapetutic agent) (SEQ ID NO: 571); 

Ac-Arg-Gly-Ser-Gly-Arg-Ser-Ala-(therapetutic agent) (SEQ ID NO: 572); 
20 Ac-Leu-Arg-Gly-Ser-Gly-Arg-Ser-Ala-(therapetutic agent) (SEQ ID NO: 

573) ; 

Ac-Leu-Arg-Gly-dSer-Ala-Arg-Ser-Ala-(therapetutic agent) (SEQ ID NO: 

574) ; 

Ac-Cys(Me)-Pro-Gly-Arg-Val-Val-(therapeutic agent) (SEQ ID NO: 575); 
25 Ac-Arg-Cvs(Me)-Pro-Gly-Arg-Val-Val-(therapeutic agent) (SEQ ID NO: 

577) ; 

Ac-Arg-Arg-Cys(Me)-Pro-G)y-Arg-VaI-Val-(therapeutic agent) (SEQ ID NO: 

578) ; 

Ac-Val-Ser-Ala-Arg-Met-Ala-(therapeutic agent) (SEQ ID NO: 579); 
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Ac-Ile-Val-Ser-AIa-Arg-Met-Ala-(therapeutic agent) (SEQ ID NO: 580); 
Ac-Val-IIe-\/al-Ser-Ala-Arg-Met-Ala-{therapeutic agent) (SEQ ID NO: 581); 
Ac-Val-lle-Val-Ser-Ala-Arg-nLeu-Aia-(therapeutic agent) {SEQ ID NO: 
582); 

5 Ac-VaI-Ser-Ala-Arg-nLeu-Ala-(therapeutic agent)- (SEQ ID NO: 583); 

Ac-ile-Val-Ser-A!a-Arg-nLeu-Ala-(therapeutic agent) (SEQ ID NO: 584); 

Ac-Gly-Ser-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 585); 

Ac-Gly-Ser-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 586); 

Ac-Gly-Ser-Ala-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 587); 
10 Ac-Ser-Gly-Arg-Ser-Leu-{therapeutic agent) (SEQ ID NO: 588); 

Ac-Ser-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 589); 

Ac-Ser-Ala-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 590); 

Ac-Arg-Gly-Ser-GIy-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 591); 

Ac-Arg-Gly-Ser-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
15 592); 

Ac-Arg-Gly-Ser-Ala-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 593); 
Ac-Leu-Arg-Gly-Ser-Gly-Arg-Ser-Leu-{therapeutic agent) (SEQ ID NO: 
594); 

Ac-Leu-Arg-Gly-Ser-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
20 595); and 

Ac-Leu-Arg-Gly-Ser-Ala-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
596). 

In another embodiment, the conjugates provided herein are 
selected from: 

25 Ac-R-Q-G-R-S-L-(therapeutic agent) (SEQ ID NO: 491); 
Ac-R-Q-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 492); 
Ac-R-Q-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 493); 
Ac-R-Q-G-R-S-nV-(therapeutic agent) (SEQ ID NO: 494); 
Ac-R-Q-G-R-S-F-(therapeutic agent) (SEQ ID NO: 495); 
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Ac-R-Q-G-R-A-L-(therapeutic agent) (SEQ ID NO: 496); 

Ac-R-Q-G-R-A-L-{therapeutic agent) (SEQ ID NO: 497); 

Ac-R-Q-G-R-A-nL-(therapeutic agent) (SEC ID NO: 498); 

Ac-R-Q-G-R-A-nL-(therapeutic agent) {SEQ ID NO: 499); 
5 Ac-R-Q-G-R-A-nV-{therapeutic agent) (SEQ ID NO: 500); 

Ac-R-Q-G-R-A-Cha-(therapeutic agent) (SEQ ID NO: 501); 

Ac-R-Q-G-R-A-F-(therapeutic agent) (SEQ ID NO: 502); 

Ac-R-N-G-R-S-L-(therapeutic agent) (SEQ ID NO: 503); 

Ac-R-N-G-R-A-nL-(therapeutic agent) (SEQ ID NO: 504); 
10 Ac-R-Q-A-R-S-L-(therapeutic agent) (SEQ ID NO: 505); 

Ac-R-Q-A-R-S-nL-(therapeutic agent) (SEQ ID NO: 506); 

Ac-R-Q-A-R-S-nV-(therapeutic agent) (SEQ ID NO: 507); 

Ac-R-Q-A-A-S-Cha-(therapeutic agent) (SEQ ID NO: 508); 

Ac-R-Q-A-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 509); 
15 Ac-R-Q-A-R-T-nL-(therapeutic agent) (SEQ ID NO: 510); 

Ac-R-Q-A-R-A-L-(therapeutic agent) (SEQ ID NO: 511); 

Ac-R-Q-A-R-A-nL-(therapeutic agent) (SEQ ID NO: 512); 

Ac-R-Q-A-R-A-nV-(therapeutic agent) (SEQ ID NO: 513); 

Ac-R-Q-A-R-A-Cha-(therapeutic agent) (SEQ ID NO: 514); 
20 Ac-R-Q-S-R-A-A-(therapeutic agent) (SEQ ID NO: 515); 

Ac-R-Q-S-R-A-(therapeutic agent) (SEQ ID NO: 516); 

Ac-R-Q-S-R-A-nL-(therapeutic agent) (SEQ ID NO: 517); 

Ac-R-Q-S-R-A-L-(therapeutic agent) (SEQ ID NO: 518); 

Ac-R-Q-S-R-A-nV-(therapeutic agent) (SEQ ID NO: 519); 
25 Ac-R-Q-S-R-A-Cha-(therapeutic agent) (SEQ ID NO: 520); 

Ac-R-Q-S-R-S-S-L-(therapeutic agent) (SEQ ID NO: 521); 

Ac-R-Q-S-R-S-L-(therapeutic agent) (SEQ ID NO: 522); 
Ac-R-Q-S-R-S-nL-(therapeutic agent) (SEQ ID NO: 523); 
Ac-R-Q-S-R-S-nL-{therapeutic agent) (SEQ ID NO: 524); 
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Ac-R-Q-S-R-S-nV-(therapeutic agent) {SEQ \D NO: 525); 

Ac-R-Q-S-R-S-allylG-(therapeutic agent) (SEQ ID NO: 526); 

Ac-R-Q-S-R-S-Cha-(therapeutic agent) (SEQ ID NO: 527); 

Ac-R-Q-S-R-T-nL-(therapeutic agent) (SEQ ID NO: 528); 
5 Ac-R-a-T-R-S-S-L-(therapeutic agent) (SEQ ID NO: 529); 

Ac-R-Q-T-R-S-L-(therapeutic agent) (SEQ ID NO: 530); 

Ac-R-N-S-R-S-nL-(t)ierapeutic agent) (SEQ ID NO: 531); 

Ac-R-a-F-R-S-L-(therapeutic agent) (SEQ ID NO: 532); 

Ac-R-Q-F-R-S-nL-(therapeutic agent) (SEQ ID NO: 534); 
10 Ac-R-Q-F-R-S-nV- (therapeutic agent) (SEQ ID NO: 535); 

Ac-R-Q-F-R-S-nL-(therapeutic agent) (SEQ ID NO: 536); 

Ac-R-a-F-R-S-Cha-(therapeutic agent) (SEQ ID NO: 537); 

Ac-R-Q-F-R-A-L-(therapeutic agent) (SEQ ID NO: 538); 

Ac-R-Q-F-R-A-nL-(therapeutlc agent) (SEQ ID NO: 539); 
15 Ac-R-Q-F-R-A-nV-(therapeutic agent) (SEQ ID NO: 540); 

Ac-R-Q-F-R-A-Cha-(therapeutic agent) (SEQ ID NO: 541); 

Ac-Q-S-R-S-S-nL-(therapeutic agent) (SEQ ID NO: 542); 

MeOCO-Quat2-G-R-S-L-(therapeutic agent) (SEQ ID NO: 483); 

MeOCO-Quat3-G-R-S-L-(therapeutic agent) (SEQ ID NO: 484); 
20 MeOCO-Quat-G-R-S-L-(therapeutic agent) (SEQ ID NO: 485); 

MeOCO-Quat4-G-R-S-L-(therapeutic agent) (SEQ ID NO: 486); 

MeOCO-Quat5-G-R-S-L-(therapeutic agent) (SEQ ID NO: 487); 

MeOCO-Quat2-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 488); 

MeOCO-Quat4-G-R-S-L-(therapeutic agent) (SEQ ID NO: 489); 
25 MeOCO-Quat2-G-R-S-L-(therapeutic agent) (SEQ ID NO: 490); 

Ac-a-G-R-S-L-(therapeutic agent) (SEQ ID NO: 445); 

Ac-Q-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 446); 

Ac-Q-G-R-A-S-L-(therapeutic agent) (SEQ ID NO: 447); 

Ac-N-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 448); 
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Ac-Q-G-R-S-S-nL-(therapeutic agent) (SEQ ID NO: 449); 

Ac-Q-G-R-S-S-nV-(therapeutic agent) iSEQ ID NO: 450); 

Ac-a-G-R-S-S-Cha-{therapeutic agent) {SEQ ID NO: 451); 

Ac-Q-G-R-S-S-allyIG-(therapeutic agent) (SEQ ID NO: 452); 
5 Ac-Q-G-R-S-S-allylG-(therapeutic agent) (SEQ ID NO: 453); 

Ac-Q-A-R-S-L-(therapeutic agent) (SEQ ID NO: 454); 

Ac-Q-A-R-S-S-L-{therapeutic agent) (SEQ ID NO: 455); 
f Ac-Q-S-R-S-L-(therapeutic agent) (SEQ ID NO: 456); 

Ac-Q-S-R-S-S-nV-(therapeutic agent) (SEQ ID NO: 457); 
10 Ac-Q-S-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 458); 

Ac-Q-S-R-S-S-L-(therapeutic agent) (SEQ ID NO: 459); 

Ac-Q-T-R-S-S-L-(therapeutic agent) (SEQ ID NO: 460); 

Ac-Q-Aib-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 461); 

Ac-Q-Aib -R-S-S-L-(therapeutic agent) (SEQ ID NO: 462); 
15 Ac-Q-Abu-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 463); 

Ac-Q-Abu-R-S-S-L-(therapeutfC agent) (SEQ ID NO: 464); 

Ac-Q-Cha-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 465); 

Ac-Q-F-R-S-L-(therapeutic agent) (SEQ ID NO: 466); 

Ac-Q-F-R-S-S-L-(therapeutic agent) (SEQ ID NO: 467); 
20 Ac-Q-Y-R-S-S-L-{therapeuttc agent) (SEQ ID NO: 468); 

Ac-R-G-R-S-L-(therapeutic agent) (SEQ ID NO: 469); 

Ac-R-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 470); 

Ac-R-G-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 471); 

Ac-R-G-R-S-Cha-(therapeutic agent) (SEQ ID NO: 472); 
25 Ac-R-A-R-S-L-(therapeutic agent) (SEQ ID NO: 473); 

Ac-R-A-R-S-S-L-(therapeutic agent) (SEQ ID NO: 474); 

Ac-R-S-R-S-L-(therapeutic agent) (SEQ ID NO: 475); 

Ac-R-S-R-S-S-L-(therapeuttc agent) (SEQ ID NO: 476); 

Ac-R-S-R-S-Cha-(therapeutic agent) (SEQ ID NO: 477); 



wo 02/095007 



PCT/US02/16819 



-127- 

Ac-R-S-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 478); 
Ac-R-F-R-S-L-(therapeutic agent) (SEQ ID NO: 479); 
Ac-R-F-R-S-Cha-(therapeutic agent) (SEC ID NO: 480); 
Ac-Y-G-R-S-S-L-{therapeutic agent) (SEQ ID NO: 481); 
5 Ac-l\/l(02)-S-R-S-L-(therapeutic agent) (SEQ ID NO: 482); 
Ac-R-R-Q-S-R-A-A-(therapeutic agent) (SEQ ID NO: 105); 
Ac-R-R-Q-S-R-l-(therapeutic agent) (SEQ ID NO: 610); 
Ac-R-R-Q-S-R-S-S-L-{therapeutic agent) (SEQ ID NO: 543); 
Ac-R-R-Q-S-R-S-L-{therapeutlc agent) (SEQ ID NO: 544); 
10 Ac-R-G-S-G-R-S-L-(therapeutic agent) fSEQ ID NO: 545); 
Ac-R-G-S-G-R--S-nL-(therapeutic agent) (SEQ ID NO: 546); 
Ac-R-G-S-G-R-A-nL-(therapeutic agent) (SEQ ID NO: 547); 
Ac-R-G-S-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 548); 
Ac-l-V-S-G-R-A-S-L-(therapeutic agent) (SEQ ID NO: 549); 
15 Ac-R-R-Q-S-R-A-(therapeutic agent) (SEQ ID NO: 108); 
Ac-R-R-Q-S-R-l-(therapeutic agent) (SEQ ID NO: 111); 
Ac-L-R-R-Q-S-R-A-A-(therapeutlc agent) (SEQ ID NO: 106); 
Ac-L-R-R-Q-S-R-G-G-(therapeutic agent) (SEQ ID NO: 109); 
Ac-L-R-R-Q-S-R-A-(therapeutic agent) (SEQ ID NO: 110); 
20 Ac-L-R-R-Q-S-R-A-l-(therapeutic agent) (SEQ ID NO: 112); 
Ac-L-R-R-Q-S-R-A-l-(therapeutic agent) (SEQ ID NO: 61 1); 
Ac-L-R-R-Q-S-R-S-S-L-(therapeutic agent) (SEQ ID NO: 550); and 
Ac-L-R-R-Q-S-R-S-L-(therapeutic agent) (SEQ ID NO: 551); 

In another embodinnent, the conjugates provided herein are 
25 selected from: 

Ac-S-G-R-S-L-(therapeutic agent) (SEQ ID NO: 362); 
Ac-S-G-R-S-S-L-{therapeutic agent) (SEQ ID NO: 363); 
Ac-S-G-R-S-S-S-L-(therapeutic agent) (SEQ ID NO: 364); 
Ac-S-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 365); 
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Ac-S-G-R-S-nV-(therapeutic agent) (SEQ ID NO: 366); isomer 1 

Ac-S-G-R-S-nV-(therapeutic agent) (SEQ ID NO: 367); isomer 2 

Ac-S-G-R-S-G(hex)-(therapeutic agent) (SEQ ID NO: 368); 

Ac-S-G-R-S-Cha-(therapeutic agent) (SEQ ID NO: 369); 
5 Ac-S-G-R-S-hCha-(therapeutic agent) (SEQ ID NO: 370); 

Ac-S-A-R-S-L-{therapeutic agent) (SEQ ID NO: 371); 

Ac-S-A-R-S-S-L-{therapeutiG agent) (SEQ ID NO: 372); 

Ac-S-S-R-S-nL-(therapeutic agent) (SEQ ID NO: 373); 

Ac-T-G-R-S-Abu-(therapeutic agent) (SEQ ID NO: 374); 
10 Ac-T-G-R-S-L-(therapeutic agent) (SEQ ID NO: 375); 

Ac-T-G-R-S-nV-(therapeutic agent) (SEQ ID NO: 376); 

Ac-T-G-R-S-nL-(therapeutic agent) {SEQ ID NO: 377); 

Ac-T-G-R-S-G(hex)-(therapeutic agent) (SEQ ID NO: 378); 

Ac-T-G-R-S-Cha-(therapeutiG agent) (SEQ ID NO: 379); 
15 Ac-T-G-R-S-hCha-(therapeutic agent) (SEQ ID NO: 380); 
Ac-T-G-R-T-Abu-(therapeutic agent) (SEQ ID NO: 381); 
Ac-T-G-R-hS-nL-(therapeutic agent) (SEQ ID NO: 382); 
Ac-T-G-R-Abu-nL-(therapeutic agent) (SEQ ID NO: 383); 
Ac-T-G-R-Abu-nV-(therapeutic agent) {SEQ ID NO: 384); 
20 Ac-T-G-F{Gn)-S-nL-(therapeutlc agent) {SEQ ID NO: 385); 
Ac-T-G-F(Gn)-S-Cha-(therapeutic agent) {SEQ ID NO: 386); 
Ac-T-G-F{Gn)-Abu-nV-{therapeutic agent) (SEQ ID NO: 387); 
Ac-T-G-K(alIoc)-S-nL-(therapeutic agent) (SEQ ID NO: 388); 
Ac-T-G-K-S-nL-(therapeutic agent) (SEQ ID NO: 389); 
25 Ac-T-G-hR-S-nL-(therapeutic agent) (SEQ ID NO: 390); 
Ac-(hS)G-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 391); 
MeOCO-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 392); 
PhS02-T-G-R-S-nL-{therapeutic agent) (SEQ ID NO: 393); 
MeOEtCO-T-G-R-S-nL- (therapeutic agent) (SEQ ID NO: 394); 
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MeO(EtO)2Ac-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 395); 

4-oxo-Pentanoyl-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 396); 

3,4-MethyldioxyPhAc-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 397); 

2-PyridylAc-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 398); 
5 PhOAc-T-G~R-S-nL-{therapeutic agent) (SEQ ID NO: 399); 

L-3-PhLactyl-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 400); 

MeOAc-T-G-R-S-nL- (therapeutic agent) (SEQ ID NO: 401); 

PhAc-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 402); 

MeOEtOCO-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 403); 
10 MeOEtOAc-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 404); 

HOOCButa-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 405); 

Z-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 406); 

EtOCO-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 407); 

;ffA-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 408); 
15 Pent-4-ynoyl-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO; 409); 

NapAc-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 410); 

iBoc-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 411); 

HOAc-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 412); 

MeSucc-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 413); 
20 N,N-diMeGly-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 414); 

Succ-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 415); 

HCO-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 416); 

Ac-T-A-R-S-nL-(therapeutic agent) (SEQ ID NO: 417); 

Ac-T-A-F(Gn)-S-nL-(therapeutlc agent) (SEQ ID NO: 418); 
25 Ac-T-A-R-Abu-nV-(therapeutic agent) (SEQ ID NO: 419); 

Ac-T-A-R-S-Abu-(therapeutic agent) (SEQ ID NO: 420); 

Ac-T-A-R-T-Abu-(therapeutrc agent) (SEQ ID NO: 421); 

Ac-T-S{0-Me)-R-S-nL-(therapeutic agent) (SEQ ID NO: 422); 

Ac-T-hS-R-S-nL-(therapeutic agent) (SEQ ID NO: 423); 
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Ac-T-{1-Me)H-R-S-nL-(therapeutic agent) (SEQ ID NO: 424); 

Ac-T-(3-Me)H-R-S-nL-Uherapeutic agent) (SEQ ID NO: 425); 

Ac-T-H-R-S-nL-(therapeutfC agent) (SEQ ID NO: 426); 

Ac-T-Sar-R-S-nL-(therapeutic agent) (SEQ ID NO: 427); 
5 Ac-T-n\/-R-S-nL-(therapeutic agent) (SEQ ID NO: 428); 

Ac-T-nL-R-S-nL-(therapeutic agent) (SEQ ID NO: 429); 

Ac-T-A-R-S-Cha-(therapeutic agent) (SEQ ID NO: 430); 

Ac-T-Abu-R-S-nL-(therapeutic agent) (SEQ ID NO: 431); 

Ac-4,4diMeThr-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 432); 
10 Ac-hS-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 433); 

Ac-hS-G-R-hS-Cha-(therapeutic agent) (SEQ ID NO: 434); 

Ac-hS-G-R-S-Cha-(therapeutic agent) (SEQ ID NO: 435); 

Ac-hS-G-R-T-Cha-(therapeutic agent) (SEQ ID NO: 436); 

Ac-hS-A-R-S-Cha-(therapeutic agent) (SEQ ID NO: 437); 
15 Ac-N-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 438); 

Ac-Y-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 439); 

Ac-Y-G-R-S-Cha-(therapeutic agent) (SEQ ID NO: 440); 

Ac-Q-G-R-S-S-nL-(therapeutic agent) (SEQ ID NO: 441); 

Ac-Q-G-R-S-S-nV-(therapeutic agent) (SEQ ID NO: 442); 
20 Ac-L-R-G-S-G-R-S-A-(therapeutic agent) (SEQ ID NO: 573); 

Ac-L-R-G-S-G-R-S-L-(therapeutic agent) (SEQ ID NO: 342); 

Ac-L-R-G-S-G-R-S-L-(therapeutic agent) (SEQ ID NO: 343); 

Ac-L-R-G-S-G-R-S-S-nL-(therapeutic agent) (SEQ ID NO: 344); 

Ac-L-R-G-S-G-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 345); 
25 Ac-L-R-G-dS-A-R-S-A-(therapeutic agent) (SEQ ID NO: 574); 

Ac-L-R-G-S-A-R-S-S-L-(therapeutic agent) (SEQ ID N0:346 ); 

Ac-L-R-G-S-A-R-S-L-(therapeutic agent) (SEQ ID NO: 347); 

Ac-L-R-G-S-A-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 348); 
Ac-L-R-G-S-A-R-S-S-nV-(therapeutic agent) (SEQ ID NO: 349); 



wo 02/095007 



PCT/US02/16819 



-131- 

Ac-L-R-G-S-A-R-S-S-nL-{therapeutic agent) (SEQ ID NO: 350); 

Ac-V-l-V-S-G-R-A-L-(therapeutic agent) (SEQ ID NO: 351); 

Ac-V-I-V-S-A-R-S-L-{therapeutic agent) (SEQ ID NO: 352); 

Ac-V-I-V-S-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 353); 
5 Ac-V-l-V-S-A-R-M-A-(therapeutic agent) (SEQ ID NO: 354); 

Ac-V-!-V-S-A-R-nL-A-{therapeutic agent) (SEQ ID NO: 355); 

Ac-V-l-V-S-A-R-S-nL-(therapeutic agent) (SEQ ID NO: 356); 

Ac-V-l-V-S-A-R-S-Cha-(therapeutic agent) (SEQ ID NO: 357); 

Ac-V-l-V-S-A-R-S-Cha-(therapeuttc agent) (SEQ ID NO: 358); 
10 Ac-V-l-V-S-A-R-S-S-Cha-{therapeutic agent) (SEQ ID NO: 359); 

Ac-R-R-(Me)C-P-G-R-V-V-(therapeutic agent) (SEQ ID NO: 360); 

Ac-R-R-nV-P-A-R-S-L-(therapeutic agent) (SEQ NO: 361); 

Ac-R-G-dS-A-R-S-A-(therapeutic agent) (SEQ ID NO: 309); 

Ac-R-G-S-G-R-S-A-(therapeutic agent) (SEQ ID NO: 310); 
15 Ac-R-G-S-G-R-A-L-(therapeutic agent) (SEQ ID "NO: 311); 

Ac-R-G-S-G-R-S-L-(therapeutic agent) (SEQ ID NO: 312); 

Ac-R-G-S-G-R--S-nL-(therapeutic agent) (SEQ ID NO: 313); 
Ac-R-G-S-G-R-A-nL-(therapeutic agent) (SEQ ID NO: 314); 
Ac-R-G-S-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 315); 
20 Ac-R-G-S-G-R-S-Cha-(therapeutic agent) (SEQ ID NO: 316); 
Ac-R-G-S-G-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 317); 
Ac-R-G-S-A-R-S-Cha-{therapeutic agent) (SEQ ID NO: 318); 
Ac-R-G-S-A-R-S-S-(therapeutic agent) (SEQ ID NO: 319); 
Ac-R-G-S-A-R-S-nV-{therapeutic agent) (SEQ ID NO: 320); 
25 Ac-R-G-S-A-R-S-S-nV -(therapeutic agent) (SEQ ID NO: 321); 
Ac-R-G-S-A-R-S-L-(therapeutic agent) (SEQ ID NO: 322); 
Ac-R-(Me)C-P-G-R-V-V-(therapeutlc agent) (SEQ ID NO: 323); 
Ac-R-(Me)C-P-G-R-V-V-(therapeutlc agent) (SEQ ID NO: 324); 
Ac-R-C(l\/Ie)-P-G-R-S-L-(therapeutic agent) (SEQ ID NO: 325); 
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Ac-R-L-P-G-R-S-L-(therapeutic agent) (SEQ ID NO: 326); 

Ac-R-V-P-G-R-S-L-(therapeutic agent) {SEQ ID NO: 327); 

Ac-R-V-P-G-R-S-L-(therapeutic agent) (SEQ (D NO: 328); 

Ac-R-nL-P-G-R-S-L-<therapeutic agent) (SEQ ID NO: 329); 
5 Ac-R-G(tBu)-P-A-R-S-L-(therapeutic agent) (SEQ ID NO: 330); 

Ac-R-L-P-A-R-S-L-(therapeutic agent) (SEQ ID NO: 331); 

Ac-R-V-P-A-R-S-L-(therapeutic agent) (SEQ ID NO: 332); 

Ac-R-nL-P-A-R-S-L-(therapeutic agent) (SEQ ID NO: 333); 

Ac-l-V-S-G-R-A-L-(therapeutic agent) (SEQ ID NO: 334); 
10 Ac-l-V-S-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 335); 

Ac-l-V-S-G-R-A-S-L-(therapeutic agent) (SEQ ID NO: 336); 

Ac-l-V-S-A-R-M-A-(therapeutic agent) (SEQ ID NO: 337); 

Ac-l-V-S-A-R-nL-A-(therapeutic agent) (SEQ ID NO: 338); 

Ac-l-V-S-A-R-S-L-(therapeutic agent) (SEQ ID NO: 339); 
15 Ac-l-V-S-A-R-S-nL-(therapeutic agent) (SEQ ID NO: 340); 

Ac-l-V-S-A-R-S-S-L-(therapeutic agent) (SEQ ID NO: 341); 

Ac-G-S-G-R-S-A-(therapeutic agent) (SEQ ID NO: 585); 

Ac-G-S-G-R-S-L-(therapeutic agent) (SEQ ID NO: 277); 

Ac-G-S-G-R-A-L-(therapeutlc agent) (SEQ ID NO: 278); 
20 Ac-G-S-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 279); 

Ac-G-S-G-R-L-(th6rapeutic agent) (SEQ ID NO: 280); 

Ac-G-S-G-(4-guan)Phg-S-L-(therapeutic agent) (SEQ ID NO: 281); 

Ac-G-S-G-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 282); 

Ac-G-S-G-R-A-S-L-(therapeutic agent) (SEQ ID NO: 283); 
25 Ac-G-S-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 284); 

Ac-G-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 285); 

Succ-bA-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 286); 

Ac-G-T-G-R-S-hCha-(therapeutic agent) (SEQ ID NO: 287); 

Ac-G-hS-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 288); 
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Ac-G-dS-A-R-S-A-(therapeutic agent) (SEQ ID NO: 289); 

Ac-G-S-A-R-S-L-(therapeutic agent) (SEQ ID NO: 290); 

Ac-G-S-A-R-S-S-Cha-{therapeutic agent) (SEQ ID NO: 291); 

Ac-G-S-A-R-S-S-L-(therapeutic agent) {SEQ ID NO: 292); 
5 Ac-G-S-A-R-A-S-L-{therapeutic agent) (SEQ ID NO: 293); 

Ac-V-S-G-R-S-L-(therapeutic agent) (SEQ ID NO: 294); 

Ac-V-S-G-R-A-L-(therapeutic agent) (SEQ ID NO: 295); 

Ac-V-S-G-R-A-S-L-(therapeutic agent) (SEQ ID NO: 296); 

Ac-V-S-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 297); 
10 Ac-V-S-A-R-M-A-(therapeutic agent) (SEQ ID NO: 298); 

Ac-V-S-A-R-nL-A-(therapeutic agent) (SEQ ID NO: 299); 

Ac-V-S-A-R-S-nL-(therapeutic agent) (SEQ ID NO: 300); 

Ac-V-S-A-R-S-L-(therapeutic agent) (SEQ ID NO: 301); 

Ac-(Me)C-P-G-R-V-V-(therapeutic agent) (SEQ ID NO: 302); 
15 Ac-(Me)C-P-G-R-V-V-(therapeutic agent) (SEQ ID NO: 303); 

Ac-C{Me)-P-G-R-A-L-(therapeutlc agent) (SEQ ID NO: 304); 

Ac-C{Me)-P-G-R-S-L-(therapeutic agent) (SEQ ID NO: 305); 

Ac-C{Me)-P-A-R-S-L-(therapeutic agent) (SEQ ID NO: 306); 

Ac-C(Me)-P-A-R-A-S-L-(therapeutic agent) (SEQ ID NO: 307);and 
20 Ac-G(tBu)-P-G-R-S-L-(therapeutic agent) (SEQ ID NO: 308); 

In another embodiment, the conjugates provided herein are 

selected from: 

Ac-Q-S-R-A-A-(therapeutic agent) (SEQ ID NO: 552); 
Ac-Q-S-R-S-A-(therapeutic agent) (SEQ ID NO: 553); 
25 Ac-Q-S-R-S-G-(therapeutic agent) (SEQ ID NO: 554); 
Ac-R-S-R-A-A-(therapeutic agent) (SEQ ID NO: 555); 
Ac-R-Q-S-R-A-A-(therapeutic agent) (SEQ ID NO: 556); 
Ac-R-Q-S-R-S-A-(therapeutic agent) (SEQ ID NO: 557); and 
Ac-R-Q-S-R-S-A-A-(therapeutic agent) (SEQ ID NO: 558); 
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In another embodiment, the conjugates provided herein are 
selected from: 

Ac-R-G-S-G-R-S-A-(therapeutic agent) {SEQ ID NO: 559); 
Ac-S-G-R-A-A-(therapeutic agent) (SEQ ID NO: 560); 
5 Ac-S-G-R-S-A-{therapeutic agent) (SEQ ID NO: 561); 
Ac-S-G-R-S-S-A-(therapeutic agent) (SEQ ID NO: 562); 
Ac-S-G-R-A-S-A-(therapeutic agent) (SEQ ID NO: 563); 
Ac-S-G-R-S-G-(therapeuttc agent) (SEQ ID NO: 564); 
Ac-S-G-R-S-S-G-(therapeutic agent) (SEQ ID NO: 565); 
10 Ac-S-G-R-S-G-A-(therapeutic agent) (SEQ ID NO: 566); 

Ac-S-G-R-S-G-G-(therapeutic agent) (SEQ ID NO: 567); and 
Ac-G-T-G-R-S-G-G-(therapeutic agent) (SEQ ID NO: 568); 

In another embodiment, the conjugates provided herein are 
selected from: 

15 Ac-L-R-R-Q-S-R-A-A-(therapeutic agent) (SEQ ID NO: 597); 

MeS02-dA(Chx)-Abu-R-S-L-(therapeutic agent) (SEQ ID NO: 598); 

Ac-R-A-R-S-L-(therapeutic agent) (SEQ ID NO: 599); 

Ac-dA(Chx)-Abu-R-S-L-(therapeutic agent) (SEQ ID NO: 600); 

Ac-dA(Chx)-Abu-R-S-S-L-(therapeutic agent) (SEQ ID NO: 601); 
20 Ac-Q-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 602); 

MeOCO-dhF-P(OH)-R-S-S-L-(therapeutic agent) (SEQ ID NO: 603); 

MeOCO-Quat4-G-R-S-L-(therapeutic agent) (SEQ ID NO: 604); 

Ac-dCha-P(OH)-R-S-S-L-(therapeutic agent) (SEQ ID NO: 605); 

Ac-dCha-Abu-R-S-S-A-(therapeutic agent) (SEQ ID NO: 606); 
25 iyieOCO-Quat2-G-R-S-L-(therapeutic agent) (SEQ ID NO: 607); 

MeOCO-Quat3-G-R-S-L-{therap6utic agent) (SEQ ID NO: 608); and 

MeOCO-Quat-G-R-S-L-(therapeutic agent) (SEQ ID NO: 609), 

It also understood that conjugates containing the above peptidic 

substrate portions can be prepared with other capping groups in place of 
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Ac (see, e.g., the description herein of the capping groups X"). 
Therapeutic agents for use in the above conjugates include, for example, 
cytotoxic agents, such as, but not limited to, a toxin such as abrin, 
ricin A, pseudomonas exotoxin shiga toxin, diphtheria toxin and other 
5 such toxins and toxic portions thereof; proteins such as tumor necrosis 
factor, interferons, such as a-interferon and gamma-rnterferon, pro- 
coagulants such as tissue factor and tissue factor variants, pro-apoptotic 
agents such FAS-ligand, nerve growth factor, platelet derived growth 
factor, tissue plasminogen activator; biological response modifiers such 
10 as, for example, lymphokines, interleukin- I {IL-I), interleukin-2 (IL-2), 
interleukin-6 (IL-6), granulocyte macrophage colony stimulating factor 
(GMCSF), granulocyte colony stimulating factor (G-CSF), fibroblast 
growth factors (FGFs) and other growth factors, the methotrexate group 
of drugs, the anthracycline family of drugs, the vinca alkaloid drugs, the 
15 mrtomycins, the bleomycins, the cytotoxic nucleosides Including cytosine 
arabinosides and difluoronucleosides, the pteridine family of drugs, 
diynenes, the taxanes and the podophyllotoxins. All such conjugates are 
within the scope of the Instant disclosure and can be prepared and used 
as disclosed herein. 
20 Thus, the conjugates provided herein Include, but are not limited 

to, those disclosed herein where the therapeutic agent is, e.g., 
doxorubicin, carminomycin, daunorubicin, detorubicin, Idarubicin, 
eplrubicin, esorubicin, THP, AD-32, aminopterin, methotrexate, 
methopterin, dichloromethotrexate, mitomycin C, porfiromycin, 
25 5-fluorouraciI, 6-mercaptopurlne, cytosine arabinoslde, podophyliotoxin, 
or podophyliotoxin derivatives such as etoposlde or etoposide phosphate, 
melphalan, vinblastine, vincristine, leurosidine, vindesine, leurosine, taxol, 
estramustlne, cisplatin, combretastatin and analogs, and 
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cyclophosphamide. In one embodiment, the therapeutic agent is 
doxorubicin. In another embodiment, the therapeutic agent is taxol. 

Any conjugates corresponding to the above conjugates or any 
conjugates disclosed herein where the PI ' and/or P2' residues are lie in 
5 place of Ala are within the scope of the instant disclosure and can be 
prepared and used as disclosed herein. 

Any peptidic substrates formed by permutation and selection of 
amino acids from those set forth in the above definitions of P groups are 
contemplated. 
10 D. Preparation of the Conjugates 

The peptidic substrates of the conjugates provided herein are 
synthesized from their constituent amino acids by conventional peptide 
synthesis techniques, such as by solid-phase technology. The peptidic 
substrates are then purified by reverse-phase high performance liquid 
15 chromatography (HPLC). 

The peptide acids can be prepared from their constituent Fmoc- 
aminoacids. Standard methods of peptide synthesis are disclosed, for 
example, in the following works: Synthesis Notes Section, NovaBiochem 
Catalog 2002/3, Schroeder a/., "The Peptides", Vol. 1, Academic 
20 Press 1965; Bodansky etal., "Peptide Synthesis", Interscience 
Publishers, 1966; McOmie (ed.) "Protective Groups in Organic 
Chemistry", Plenum Press, 1973, Barany ef a!., 'The Peptides: Analysis, 
Synthesis, Biology" 2, Chapter 1, Academic Press, 1990, and Stewart et 
al., "Solid Phase Peptide Synthesis", Second Edition, Pierce Chemical 
25 Company, 1 994. The disclosures of these references are hereby 
incorporated by reference. 

The pharmaceutically acceptable salts of the conjugates provided 
herein Include the conventional non-toxic salts of the conjugates as 
formed, e,g,, from non-toxic inorganic or organic acids. For example, 
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such conventional non-toxic salts include those derived from inorganic 
acids such as hydrochloric, hydrobromic, sulfuric, sulfamic, phosphoric, 
nitric and the like: and the salts prepared from organic acids such as 
acetic, propionic, succinic, glycoiic, stearic, lactic, malic, tartaric, citric, 
5 ascorbic, pamoic, maleic, hydroxymaleic, phenyl-acetic, glutamic, 

benzoic, salicylic, sulfanilic, 2-acetoxy-benzoic, fumaric, toluenesulfonic, 
methanesulfonic, ethane disulfonrc, oxalic, isethionic, trifluoroacetic and 
the like. 

The conjugates provided herein that contain the peptidic moieties 
10 containing the cell surface protease cleavage site and a therapeutic agent 
can similarly be synthesized by techniques known to those of skill in the 
art. For example, a free amine moiety on the therapeutic agent can be 
covalently attached to the peptidic substrate at the carboxyl terminus 
such that an amide bond is formed- Similarly, an amide bond can be 
15 formed by covalently coupling an amine moiety of the peptidic substrate 
and a carboxyl moiety of the therapeutic agent. For these purposes a 
reagent such as 2-(1H-benzotriazol-1-yl)-1 ,3,3-tetramethyl-uronium 
hexafluorophosphate (known as HBTU) and 1 -hyroxybenzotriazole 
hydrate (known as HOBT), dicyclohexyl-carbodiimide (DCC), 
20 N-ethyl-N-(3-dimethylaminopropyl)- carbodiimide (EDO, diphenyl- 
phosphorylazide (DPPA), benzotriazol-1 -yl-oxy-tris-(dimethylamino)- 
phosphonium hexafluorophosphate (BOP) and the like, used in 
combination or singularly, can be utilized. 

Furthermore, the instant conjugates can be formed by a non- 
25 peptidyl bond between the cell surface protease cleavage site and a 

therapeutic agent. For example, the therapeutic agent can be covalently 
attached to the carboxyl terminus of the peptidic substrate via a hydroxyl 
moiety on the therapeutic agent, thereby forming an ester linkage. For 
this purpose a reagent such as a combination of HBTU and HOBT, a 
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combination of BOP and imidazoie, a combination of DCC and DMAP, and 
the like can be utilized. The carboxylic acid also can be activated by 
forming the nitro-phenyl ester or the like and reacted in the presence of 
DBU (1 ,8-dia2abicyclo[5,4,0]undec-7-ene). 
5 The instant conjugates also can be formed by attachment of the 

peptidic substrate to the therapeutic agent via a linker unit. Such linker 
units include, for example, a biscarbonyl alkyi diradical whereby an amine 
moiety on the therapeutic agent is connected with the linker unit to form 
an amide bond and the amino terminus of the peptidic substrate is 
10 connected with the other end of the linker unit also forming an amide 

bond. Conversely, a diaminoalkyi diradical linker unit, whereby a carbonyl 
moiety on the cytotoxic agent is covalently attached to one of the amines 
of the linker unit while the other amine of the linker unit is covalently 
attached to the C-terminus of the peptidic substrate, also can be useful. 
15 Other such linker units which are stable to the physiological environment 
when not in the presence of a cell surface protease, or a soluble, shed or 
released form thereof, but are cleavable upon the cleavage of the cell 
surface protease proteolytic cleavage site, or a soluble, shed or released 
form thereof, are also envisioned. Furthermore, linker units can be utilized 
20 that, upon cleavage of the cell surface protease proteolytic cleavage site, 
remain attached to the therapeutic agent but do not significantly decrease 
the therapeutic activity of such a post-cleavage therapeutic agent 
derivative when compared with an unmodified therapeutic agent. 

One skilled in the art understands that in the synthesis of the 
25 conjugates provided herein, one can need to protect various reactive 
functionalities on the starting compounds and intermediates while a 
desired reaction is carried out on other portions of the molecule. After the 
desired reactions are complete, or at any desired time, normally such 
protecting groups will be removed by, for example, hydrolytic or 
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hydrogenolytic means. Such protection and deprotection steps are 
conventional in organic chemistry. One skilled in the art is referred to 
Protective Groups in Organic Chemistry, McOmie, ed.. Plenum Press, NY, 
NY (1973); and. Protective Groups in Organic Synthesis, Greene, ed., 
5 John Wiley & Sons, NY, NY (1991) for the teaching of protective groups 
which can be useful in the preparation of the conjugates provided herein. 

By way of example only, useful amino-protecting groups can 
include, for example, Ci-Ciq alkanoyi groups such as formyl, acetyl, 
dichloroacetyl, propionyl, hexanoyi, 3,3-diethylhexanoyl, K-chlorobutryl, 
10 and the like; C^-C^q alkoxycarbonyl and Cg-C^g aryloxycarbonyl groups 
such as tert-butoxycarbonyl, benzyloxycarbonyl, allyloxycarbonyl, 
4-nitrobenzyloxycarbonyl, fluorenylmethyloxycarbonyl and cinnamoyloxy- 
carbonyl; halo(C,-Cio)-alkoxycarbonyl such as 2,2,2-trichloroethoxy- 
carbonyl; and C,-Ci5 arylalkyi and alkenyl group such as benzyl, 
15 phenethyl, allyl, trityl, and the like. Other commonly used amino- 
protecting groups are those in the form of enamines prepared with 
jff-keto-esters such as methyl or ethyl acetoacetate. 

Useful carboxy-protecting groups can Include, for example, C^-C^q 
alky! groups such as methyl, tert-butyl, decyl; halo Ci-C,o alkyi such as 
20 2,2,2-trichloroethyl, and 2-iodoethyl; Cg-C^g arylalkyi such as benzyl, 

4-methoxybenzyI, 4-nitro benzyl, triphenylmethyl, diphenyl-methyl; C,-Cio 
alkanoyloxymethyl such as acetoxy-methyl, propionoxymethyl and the 
like; and groups such as phenacyl, 4-halophenacyi, allyl, dimethylallyl, 
tri-(CTC3 alkyDsilyl, such as trimethylsilyl, ^-p-toluenesulfonylethyl, 
25 ;ff-p-nitrophenyl-thioethyI, 2,4,6-trimethyibenzyl, )ff-methylthioethyl, 
phthalimidomethyl, 2,4-dinitro-phenylsulphenyl, 2-nitrobenzhydryl and 
related groups. 

Similarly, useful hydroxy protecting groups can include, for 
example, the formyi group, the chloroacetyl group, the benzyl group, the 
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benzhydryl group, the trityl group, the 4-nitrobenzyl group, the 
trimethylsilyl group, the phenacyl group, the tert-butyl group, the 
methoxymethyl group, the tetrahydropyranyl group, the tert-butyl- 
dimethylsily! group and the like. 
5 With respect to the embodiment of a peptidic substrate combined 

with the anthracycline antibiotic doxorubicin, the following Reaction 
Schemes illustrate the synthesis of the conjugates provided herein. 
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REACTION SCHEME I 




OMe O OH O 
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REACTION SCHEME lit 




OMe O OH O 
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REACTION SCHEME IV 
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REACTION SCHEME V 
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Reaction Scheme VI illustrates preparation of the conjugates 
provided herein of a peptidic substrate and the vinca alkaloid cytotoxic 
agent vinblastine wherein the attachment of vinblastine is at the 
C-terminus of the peptidic substrate. The use of the 1 ,3-diaminopropane 
5 linker is illustrative only; other linker units between the carbonyl of 
vinblastine and the C-terminus of the peptidic substrate are also 
envisioned (e.g., {CH2)^,'T-{CH2)J. The acyl azide starting material is 
prepared from vinglasine by reaction with hydrazine (60-65 MeOH), 
followed by reaction with HCI/DMF/isoamyl nitrite. Furthermore, Reaction 

10 Scheme VI illustrates a synthesis of conjugates wherein the C4-hydroxy 
moiety is reacetylated following the addition of the linker unit. It is known 
that the desacetyl vinblastine conjugate also is efficacious and can be 
prepared by eliminating the steps shown in Reaction Scheme VI of 
protecting the primary amine of the linker and reacting the intermediate 

15 with acetic anhydride, followed by deprotection of the amine (see, e.g., 
International Patent Application Publication No. WO 98/10651). 
Conjugation of the peptidic substrate at other positions and functional 
groups of vinblastine can be readily accomplished by one of ordinary skill 
in the art and also is expected to provide conjugates that are substrates 

20 for cell surface proteases, or a soluble, shed or released form thereof. 



I 
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REACTION SCHEME VI 
OH 




Me 



NH-pepUde-X" 
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Reaction Scheme VII illustrates preparation of certain of the 
conjugates utilized in the compositions and methods provided herein 
wherein the peptidic substrates are combined with the vinca alkaloid 
cytotoxic agent vinblastine. Attachment of the N-terminus of the peptidic 
5 substrate to vinblastine is illustrated (S.P. Kandukuri era/. (1985) J. Med. 
Chem, 25:1079-1088). 

It also is understood that conjugates can be prepared wherein the 
N-terminus of the peptidic substrate utilized in the compositions and 
methods provided herein is combined with one therapeutic agent, such as 

10 a cytotoxic agent, such as vinblastine, while the C-terminus is 

simultaneously attached to another cytotoxic agent, which is the same or 
different cytotoxic agent, such as doxorubicin. Reaction Scheme Vlll 
illustrates the synthesis of such a polycytotoxic agent conjugate. Such a 
polycytotoxic conjugate can offer advantages over a conjugate containing 

15 only one cytotoxic agent. 
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REACTION SCHEME VII 
OH 




N^H,. reflux, MeOH 



Me ^ OH 
vinblastine CO^CHj 




HONO 



Me - OH 



CONHNHj 




Me - OH 



CON3 
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REACTION SCHEME VUl 
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OH 
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2. ACjO, pyriding 
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REACTION SCHEME VFH (Continued) 
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With respect to the embodiment of a peptidic substrate combined 
with desacetylvtnblastine, the following Reaction Schemes IX and X 
illustrate the synthesis of the conjugates provided herein. 

Reaction Scheme IX illustrates preparation of conjugates provided 
5 herein containing the peptidic substrates provided herein and the vinca 
alkaloid cytotoxic agent vinblastine wherein the attachment of the oxygen 
of the 4-desacetylvinblastine is at the C-terminus of the peptidic 
substrate. While other sequences of reactions can be useful in forming 
such conjugates, it is known that initial attachment of a single amino acid 

10 to the 4-oxygen and subsequent attachment of the remaining peptidic 
substrate sequence to that amino acid is an exemplarary method (see. 
International Patent Application Publication No. WO 99/28345). It also is 
known that 3,4-dihydro-3-hydroxy-4-oxo-1 ,2,3-ben2otriazine (ODHBT) 
can be utilized in place of HO At in the final coupling step. 

15 Reaction Scheme X illustrates preparation of conjugates of the 

peptidic substrates provided herein wherein a hydroxy alkanoyi acid is 
used as a linker between the vinca drug and the peptidic substrate. 
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REACTION SCHEME IX 
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REACTION SCHEME IX (Continued) 
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REACTION SCHEME X 



N-protected amino acid . 
+ 



DMAP/DCC 



HO-(CH,)J(CH,VCO,benzyl 



N-protected amino acid-0-(CH,)^TCCHjX-COjbenzyl 



OH 




hydrogen ation 



1. N-protected amino acid-O-(CHj)^T(CH,)^-C0jH 
DMAP/DCC 



2. deprotect 



CH3O 



Me - OH 
COjMe 
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REACTION SCHEME X (Continued) 

OH 





X'-peptide-ainino acid-0-(CH2) J(CH,)^ 
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Taxol conjugates provided herein nnay be prepared by the general 
method provided below. The preparation of 7-Ala-Taxol and 7-Giy-Taxol 
is disclosed in Mathew et af. (1992) J. Med. Chem. 35:145-151. 

O 




Ac-Arg-Gln-Ser-Arg-Ala-OH 

HATU 

2,6-lutidine 
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E. Formulation and administration of pharmaceutical compositions 

The conjugates and compositions provided herein are used for 
treating, preventing, or ameliorating one or more symptoms of any 
disease or disorder that can be treated by targeting a cell or tissue that 
5 expresses a cell surface protease, particularly, a serine protease, on its 
surface at higher levels compared to other cells, or soluble, shed or 
released forms thereof. These include, but are not limited to, 
hyperproliferative diseases, such as cancer, any disease associated with 
aberrant or excessive angiogenesis, autoimmune disorders, inflammatory 
10 diseases and any other disease for which an appropriate ceil surface 
protease, including cell-associated and cell-localized proteases, can be 
identified. 

The pharmaceutical compositions provided herein contain 
therapeutically effective amounts of one or more of the conjugates 

15 provided herein that are useful in the prevention, treatment, or 

amelioration of one or more of the symptoms of diseases or disorders 
associated with undesired and/or uncontrolled angiogenesis or 
neovascularization. Such diseases or disorders include, but are not 
limited to, solid neoplasms, including lung, colon, esophageal, breast, 

20 ovarian and prostate cancers; vascular malformations and cardiovascular 
disorders, including, but not limited to, angiofibroma, angiolipoma, 
atherosclerosis, restenosis/reperfusion injury, arteriovenous 
malformations, hemangiomatosis and vascular adhesions, 
dyschondroplasia with vascular hematomas, hereditary hemorrhagic 

25 telangiectasia and Von Hippie Lindau syndrome; chronic inflammatory 
diseases and abherent wound repairs, including, but not limited to, 
diabetes meflitus, hemophiliac joints, inflammatory bowel disease, 
nonhealing fractures, rapidly progressing periodontitis, juvenile 
periodontitis, psoriasis, rheumatoid arthritis, venous stasis ulcers. 
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granulations-burns, hypertrophic scars, liver cirrhosis, osteoradionecrosis, 
postoperative adhesions, pyogenic granuloma and systemic sclerosis; 
circulatory disorders, including, but not limited to, Raynaud's 
phenomenon; crest syndromes, including, but not limited to, calcinosis, 
5 esophageal, dyomotiloty, sclerodactyly and teangiectasis; dermatological 
disorders, including, but not limited to, systemic vasculitis, scleroderma, 
pyoderma gangrenosum, vascuiopathy, venous, arterial ulcers, Sturge- 
Weber syndrome. Port-wine stains, blue rubber bieb nevus syndrome, 
Klippel-Trenaunay-Weber syndrome and Osler-Weber-Rendu syndrome; 
10 and ocular disorders, including, but not limited to, blindness caused by 
ocular neovascular disease, corneal graft neovascularization, macular 
degeneration in the eye, neovascular glaucoma, trachoma, diabetic 
retinopathy, myopic degeneration, retinopathy of prematurity, retrolental 
fibroplasia and corneal neovascularization. 
IB The compositions contain one or more conjugates provided herein. 

The conjugates can be formulated into suitable pharmaceutical 
preparations such as, for example, solutions, suspensions, tablets, 
dispersible tablets, pills, capsules, powders, sustained release 
formulations or elixirs, for oral administration or in sterile solutions or 
20 suspensions for parenteral administration, as well as transdermal patch 
preparation and dry powder inhalers. Typically the cojugates described 
above are formulated into pharmaceutical compositions using techniques 
and procedures well known in the art (see, e.g., Ansel (1985) 
introduction to Pharmaceutical Dosage Forms. Fourth Edition, p, 126)). 
25 Effective concentrations can be empirically determined using animal 
models, in vitro models or test subjects. 

In the compositions, effective concentrations of one or more 
conjugates or pharmaceutically acceptable derivatives thereof is (are) 
mixed with a suitable pharmaceutical carrier or vehicle. The conjugates 
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can be derivatized as the corresponding salts, esters, enol ethers or 
esters, acids, bases, solvates or hydrates prior to formulation, as 
described above. The concentrations of the conjugates in the 
compositions are effective for delivery of an amount, upon administration, 
5 that treats, prevents, or ameliorates one or more of the symptoms of 
diseases or disorders associated with undesired and/or uncontrolled 
angiogenesis or neovascularization. Such diseases or disorders include, 
but are not limited to, solid neoplasms; vascular malformations and 
cardiovascular disorders, including, but not limited to, angiofibroma, 

10 angiolipoma, atherosclerosis, restenosis/reperfusion injury, arteriovenous 
malformations, hemangiomatosis and vascular adhesions, 
dyschondroplasia with vascular hamartomas, hereditary hemorrhagic 
telangiectasia and Von Hippie Lindau syndrome; chronic inflammatory 
diseases and abherent wound repairs, including, but not limited to, 

15 diabetes mellitus, hemophiliac joints, inflammatory bowel disease, 
nonhealing fractures, rapidly progressing periodontitis, juvenile 
periodontitis, psoriasis, rheumatoid arthritis, venous stasis ulcers, 
granulations-burns, hypertrophic scars, liver cirrhosis, osteoradionecrosis, 
postoperative adhesions, pyogenic granuloma and systemic sclerosis; 

20 circulatory disorders, including, but not limited to, Raynaud's 

phenomenon; crest syndromes, including, but not limited to, calcinosis, 
esophageal, dyomotiloty, sclerodactyly and teangiectasis; dermatological 
disorders, including, but not limited to, systemic vasculitis, scleroderma, 
pyoderma gangrenosum, vasculopathy, venous, arterial ulcers, Sturge- 

25 Weber syndrome. Port-wine stains, blue rubber bleb nevus syndrome, 
Klippel-Trenaunay-Weber syndrome and Osier- Weber-Rend u syndrome; 
and ocular disorders, including, but not limited to, blindness caused by 
ocular neovascular disease, corneal graft neovascularization, macular 
degeneration in the eye, neovascular glaucoma, trachoma, diabetic 
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retinopathy, myopic degeneration, retinopathy of prematurity, retrolental 
fibroplasia and corneal neovascularization. 

The conjugates herein can be formulated into pharmaceutical 
compositions suitable for topical, local, intravenous and systemic 
5 application. Effective concentrations of one or more of the conjugates are 
mixed with a suitable pharmaceutical carrier or vehicle. The 
concentrations or amounts of the conjugates that are effective requires 
delivery of an amount, upon administration, that ameliorates the 
symptoms or treats the disease. Typically, the compositions are 

10 formulated for single dosage administration. Therapeutically effective 

concentrations and amounts can be determined empirically by testing the 
conjugates in known in vitro and in vivo systems, such as those 
described here; dosages for humans or other animals can then be 
extrapolated therefrom. 

15 Upon mixing or addition of the conjugate{s) with the vehicle, the 

resulting mixture can be a solution, suspension, emulsion or other such 
composition. The form of the resulting mixture depends upon a number 
of factors, including the intended mode of administration and the 
solubility of the conjugate in the selected carrier or vehicle. The effective 

20 concentration is sufficient for ameliorating the symptoms of the disease, 
disorder or condition treated and can be empirically determined based 
upon in vitro and/or in vivo data, such as the data from the mouse 
xenograft model for tumors or rabbit ophthalmic model. If necessary, 
pharmaceutically acceptable salts or other derivatives of the conjugates 

25 can be prepared. 

Pharmaceutical carriers or vehicles suitable for administration of the 
conjugates provided herein include any such carriers known to those 
skilled in the art to be suitable for the particular mode of administration. 
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In addition, the conjugates can be formulated as the sole 
pharmaceutically active ingredient in the composition or can be combined 
with other active ingredients. 

The conjugates can be administered by any appropriate route, for 
example, orally, parenterally, intravenously, intradermally, 
subcutaneously, or topically, in liquid, semi-liquid or solid form and are 
formulated in a manner suitable for each route of administration. 
Exemplary modes of administration depend upon the indication treated. 
Dermatological and ophthalmologic indications will typically be treated 
locally; whereas, tumors and vascular proliferative disorders, will typically 
be treated by systemic, intradermal or intramuscular, modes of 
administration. 

The conjugate is included in the pharmaceutically acceptable carrier 
in an amount sufficient to exert a therapeutically useful effect in the 
absence of undesirable side effects on the patient treated. It is 
understood that number and degree of side effects depends upon the 
condition for which the conjugates are administered. For example, 
certain toxic and undesirable side effects are tolerated when treating life- 
threatening illnesses, such as tumors, that would not be tolerated when 
treating disorders of lesser consequence. 

The concentration of conjugate in the composition will depend on 
absorption, inactivation and excretion rates thereof, the dosage schedule, 
and amount administered as well as other factors known to those of skill 
in the art. 

Typically a therapeutically effective dosage should produce a serum 
concentration of active ingredient of from about 0.1 ng/ml to about 50- 
100 jjg/m\. The pharmaceutical compositions typically should provide a 
dosage of from about 0.01 mg to about 100 - 2000 mg of conjugate, 
depending upon the conjugate selected as adjusted for body surface area 
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and/or weight. Typically, for intravenous or systemic treatment a daily 
dosage of about between 0.05 and 0.5 mg/kg should be sufficient. Local 
application for ophthalmic disorders should provide about 1 ng up to 100 
fJQ, generally about 1 fjq to about 10 //g, per single dosage 
5 administration. It is understood that the amount to administer is a 
function of the conjugate selected, the indication treated, and possibly 
the side effects that will be tolerated. Dosages can be empirically 
determined using recognized models for each disorder. 

Typically, the compositions are formulated for single dosage 

10 administration. To formulate a composition, the weight fraction of 
conjugate Is dissolved, suspended, dispersed or otherwise mixed in a 
selected vehicle at an effective concentration such that the treated 
condition is relieved or ameliorated. Pharmaceutical carriers or vehicles 
suitable for administration of the conjugates provided herein include any 

15 such carriers known to those skilled in the art to be suitable for the 
particular mode of administration. 

In addition, the conjugates can be formulated as the sole ingredient 
in the composition or can be combined with other active ingredients. 
Liposomal suspensions, including tissue-targeted liposomes, particularly 

20 tumor-targeted liposomes, also can be suitable as pharmaceutically 

acceptable carriers. These can be prepared according to methods known 
to those skilled in the art. For example, liposome formulations can be 
prepared as described in U.S. Patent No. 4,522,81 1. Briefly, liposomes 
such as multilamellar vesicles (MLV's) can be formed by drying down egg 

25 phosphatidyl choline and brain phosphatidyl serine (7:3 molar ratio) on 
the inside of a flask. A solution of a conjugate provided herein in 
phosphate buffered saline lacking divalent cations fPBS) is added and the 
flask shaken until the lipid film is dispersed. The resulting vesicles are 
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washed to remove unencapsulated conjugate, pelleted by centrifugation, 
and then resuspended in PBS. 

The conjugate is included in the pharmaceutically acceptable carrier 
in an amount sufficient to exert a therapeutically useful effect in the 
5 absence of undesirable side effects on the patient treated. The 

therapeutically effective concentration can be determined empirically by 
testing the conjugates in in vitro and in vivo systems described herein 
(see, e.g., EXAMPLES 3 and 4) and then extrapolated therefrom for 
dosages for humans. 

10 The concentration of conjugate in the pharmaceutical composition 

will depend on absorption, inactivation and excretion rates of the 
conjugate, the physicochemical characteristics of the conjugate, the 
dosage schedule, and amount administered as well as other factors 
known to those of skill in the art. For example, the amount that is 

15 delivered is sufficient to ameliorate one or more of the symptoms of 
diseases or disorders associated with undesired and/or uncontrolled 
angiogenesis or neovascularization, as described herein. 

Typically a therapeutically effective dosage should produce a serum 
concentration of active ingredient of from about 0.1 ng/ml to about 50- 

20 100 A/g/ml. The pharmaceutical compositions typically should provide a 
dosage of from about 0.001 mg to about 2000 mg of conjugate per kilo- 
gram of body weight per day. Pharmaceutical dosage unit forms are 
prepared to provide from about 1 mg to about 1 000 mg and generally 
from about 10 to about 500 mg of the essential active Ingredient or a 

25 combination of essential ingredients per dosage unit form. 

The conjugate can be administered at once, or can be divided into 
a number of smaller doses to be administered at intervals of time. It is 
understood that the precise dosage and duration of treatment is a 
function of the disease being treated and can be determined empirically 
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using known testing protocols or by extrapolation from in vivo or in vitro 
test data. It is to be noted that concentrations and dosage values can 
also vary with the severity of the condition to be alleviated. It is to be 
further understood that for any particular subject, specific dosage 
5 regimens should be adjusted over time according to the individual need 
and the professional judgment of the person administering or supervising 
the administration of the compositions, and that the concentration ranges 
set forth herein are exemplary only and are not intended to limit the 
scope or practice of the claimed compositions. 

10 Exemplary pharmaceutically acceptable derivatives include acids, 

bases, enol ethers and esters, salts, esters, hydrates, solvates and 
conjugate forms. The derivative is selected such that its pharmacokinetic 
properties are superior to the corresponding neutral conjugate. 

Thus, effective concentrations or amounts of one or more of the 

15 conjugates described herein or pharmaceutically acceptable derivatives 
thereof are mixed with a suitable pharmaceutical carrier or vehicle for 
systemic, topical or local administration to form pharmaceutical 
compositions. Conjugates are included in an amount effective for 
ameliorating one or more symptoms of, or for treating or preventing 

20 diseases or disorders associated with undesired and/or uncontrolled 
angiogenesis or neovascularization, as described herein. The 
concentration of conjugate in the composition will depend on absorption, 
inactivation, excretion rates of the conjugate, the dosage schedule, 
amount administered, particular formulation as well as other factors 

25 known to those of skill in the art. 

The compositions are intended to be administered by a suitable 
route, including orally, parenterally, rectally, topically and locally. For oral 
administration, capsules and tablets are generally employed. The 
compositions are in liquid, semi-liquid or solid form and are formulated in 
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a manner suitable for each route of administration. Exemplary modes of 
administration include parenteral and oral modes of administration. 

Solutions or suspensions used for parenteral, intradermal, 
subcutaneous, or topical application can include any of the following 
5 components: a sterile diluent, such as water for injection, saline solution, 
fixed oil, polyethylene glycol, glycerine, propylene glycol or other 
synthetic solvent; antimicrobial agents, such as benzyl alcohot and methyl 
parabens; antioxidants, such as ascorbic acid and sodium bisulfite; 
chelating agents, such as ethylenediaminetetraacetic acid (EDTA); 

10 buffers, such as acetates, citrates and phosphates; and agents for the 
adjustment of tonicity such as sodium chloride or dextrose. Parenteral 
preparations can be enclosed in ampules, disposable syringes or single or 
multiple dose vials made of glass, plastic or other suitable material. 

In instances in which the conjugates exhibit insufficient solubility, 

15 methods for sotubilizing conjugates can be used. Such methods are 
known to those of skill in this art, and include, but are not limited to, 
using cosolvents, such as dimethylsulfoxide (DMSO), using surfactants, 
such as TWEEN®, or dissolution in aqueous sodium bicarbonate. 
Derivatives of the conjugates also can be used in formulating effective 

20 pharmaceutical compositions. 

Upon mixing or addition of the conjugate(s), the resulting mixture 
can be a solution, suspension, emulsion or the like. The form of the 
resulting mixture depends upon a number of factors, including the 
Intended mode of administration and the solubility of the conjugate in the 

25 selected carrier or vehicle. The effective concentration is sufficient for 
ameliorating the symptoms of the disease, disorder or condition treated 
and can be empirically determined. 

The pharmaceutical compositions are provided for administration to 
humans and animals in unit dosage forms, such as tablets, capsules, pills. 
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powders, granules, sterile parenteral solutions or suspensions, and oral 
solutions or suspensions, and oil-water emulsions containing suitable 
quantities of the conjugates or pharmaceutically acceptable derivatives 
thereof. The conjugates and derivatives thereof are typically formulated 
5 and administered in unit-dosage forms or multiple-dosage forms. 

Unit-dose forms as used herein refers to physically discrete units suitable 
for human and animal subjects and packaged individually as is known in 
the art. Each unit-dose contains a predetermined quantity of the 
conjugate sufficient to produce the desired therapeutic effect, in 

10 association with the required pharmaceutical carrier, vehicle or diluent. 
Examples of unit-dose forms include ampoules and syringes and 
individually packaged tablets or capsules. Unit-dose forms can be 
administered in fractions or multiples thereof. A multiple-dose form is a 
plurality of identical unit-dosage forms packaged in a single container to 

15 be administered in segregated unit-dose form. Examples of multiple-dose 
forms include vials, bottles of tablets or capsules or bottles of pints or 
gallons. Hence, multiple dose form is a multiple of unit-doses which are 
not segregated in packaging. 

The composition can contain along with the conjugate: a diluent 

20 such as lactose, sucrose, dicalcium phosphate, or carboxymethyl- 

celluiose; a lubricant, such as magnesium stearate, calcium stearate and 
talc; and a binder such as starch, natural gums, such as gum acacia- 
gelatin, glucose, molasses, polyvinylpyrrolidine, celluloses and derivatives 
thereof, povidone, crospovidones and other such binders known to those 

25 of skill in the art. Liquid pharmaceutically administrable compositions 
can, for example, be prepared by dissolving, dispersing, or otherwise 
mixing a conjugate as defined above and optional pharmaceutical 
adjuvants in a carrier, such as, for example, water, saline, aqueous 
dextrose, glycerol, glycols, ethanol, and the like, to thereby form a 
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solution or suspension. If desired, the pharmaceuticai composition to be 
administered can also contain minor amounts of nontoxic auxiliary 
substances sucli as wetting agents, emulsifying agents, or solubilizing 
agents, pH buffering agents and the iike, for example, acetate, sodium 
5 citrate, cyclodextrine derivatives, sorbitan monolaurate, triethanolamine 
sodium acetate, triethanolamine oleate, and other such agents. Actual 
methods of preparing such dosage forms are known, or will be apparent, 
to those skilled in this art; for example, see Remington's Pharmaceutical 
Sciences, Mack Publishing Company, Easton, Pa., 15th Edition, 1975. 
10 The composition or formulation to be administered will, in any event, 
contain a quantity of the conjugate in an amount sufficient to alleviate 
the symptoms of the treated subject. 

Dosage forms or compositions containing active ingredient in the 
range of 0.005% to 100% with the balance made up from non-toxic 
15 carrier can be prepared. For oral administration, a pharmaceutically 

acceptable non-toxic composition is formed by the incorporation of any of 
the normally employed excipients, such as, for example pharmaceutical 
grades of mannitol, lactose, starch, magnesium stearate, talcum, 
cellulose derivatives, sodium crosscarmellose, glucose, sucrose, 
20 magnesium carbonate or sodium saccharin. Such compositions include 
solutions, suspensions, tablets, capsules, powders and sustained release 
formulations, such as, but not limited to, implants and microencapsulated 
delivery systems, and biodegradable, biocompatible polymers, such as 
collagen, ethylene vinyl acetate, polyanhydrides, polyglycolic acid, 
25 polyorthoesters, polylactic acid and others. Methods for preparation of 
these compositions are known to those skilled In the art. The 
contemplated compositions can contain 0.001 %-1 00% active ingredient, 
such as 0.1-85%, for example 75-95%. 
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The conjugates or pharmaceutically acceptable derivatives can be 
prepared with carriers that protect the conjugate against rapid elimination 
from the body, such as time release formulations or coatings. The 
compositions can include other conjugates to obtain desired combinations 
of properties. The conjugates provided herein, or pharmaceutically 
acceptable derivatives thereof as described herein, also can be 
advantageously administered for therapeutic or prophylactic purposes 
together with another pharmacological agent known in the general art to 
be of value in treating one or more of the diseases or medical conditions 
referred to hereinabove, such as diseases or disorders associated with 
undesired and/or uncontrolled angiogenesis or neovascularization. It is to 
be understood that such combination therapy constitutes a further aspect 
of the compositions and methods of treatment provided herein. 

1 . Compositions for oral administration 

Oral pharmaceutical dosage forms are either solid, gel or liquid. The 
solid dosage forms are tablets, capsules, granules, and bulk powders. 
Types of oral tablets include compressed, chewable lozenges and tablets 
which can be enteric-coated, sugar-coated or film-coated. Capsules can 
be hard or soft gelatin capsules, while granules and powders can be 
provided in non-effervescent or effervescent form with the combination 
of other ingredients known to those skilled in the art. 

In certain embodiments, the formulations are solid dosage forms, 
such as, for example, capsules or tablets. The tablets, pills, capsules, 
troches and other dosage forms can contain, for example, any of the 
following ingredients, or compounds of a similar nature: a binder; a 
diluent; a disintegrating agent; a lubricant; a glidant; a sweetening agent; 
and a flavoring agent. 

Examples of binders include microcrystalline cellulose, gum 
tragacanth, glucose solution, acacia mucilage, gelatin solution, sucrose 
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and starch paste. Lubricants include talc, starch, magnesium or calcium 
stearate, lycopodium and stearic acid. Diluents include, for example, 
lactose, sucrose, starch, kaolin, salt, mannitol and dicalcium phosphate. 
Glidants include, but are not limited to, colloidal silicon dioxide. 
5 Disintegrating agents include crosscarmellose sodium, sodium starch 
glycolate, alginic acid, corn starch, potato starch, bentonite, 
methylcellulose, agar and carboxymethylcellulose. Coloring agents 
include, for example, any of the approved certified water soluble FD and 
C dyes, mixtures thereof; and water insoluble FD and C dyes suspended 
10 on alumina hydrate. Sweetening agents include sucrose, lactose, mannitol 
and artificial sweetening agents such as saccharin, and any number of 
spray dried flavors. Flavoring agents include natural flavors extracted 
from plants such as fruits and synthetic blends of conjugates which 
produce a pleasant sensation, such as, but not limited to peppermint and 
15 methyl salicylate. Wetting agents include propylene glycol monostearate, 
sorbitan monooleate, diethylene glycol monolaurate and polyoxyethylene 
laura! ether. Emetic-coatings include fatty acids, fats, waxes, shellac, 
ammoniated shellac and cellulose acetate phthalates. Film coatings 
include hydroxyethylcellulose, sodium carboxymethylcellulose, 
20 polyethylene glycol 4000 and cellulose acetate phthalate. 

If oral administration Is desired, the conjugate could be provided in 
a composition that protects it from the acidic environment of the 
stomach. For example, the composition can be formulated in an enteric 
coating that maintains its integrity in the stomach and releases the 
25 conjugate in the intestine. The composition also can be formulated in 
combination with an antacid or other such ingredient. 

When the dosage unit form is a capsule, it can contain, in addition 
to material of the above type, a liquid carrier such as a fatty oil. In 
addition, dosage unit forms can contain various other materials which 
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modify the physical form of the dosage unit, for example, coatings of 
sugar and other enteric agents. The conjugates also can be administered 
as a component of an elixir, suspension, syrup, wafer, sprinkle, chewing 
gum or the like. A syrup can contain, in addition to the conjugates, 
5 sucrose as a sweetening agent and certain preservatives, dyes and 
colorings and flavors. 

The conjugates also can be mixed with other active materials 
which do not impair the desired action, or with materials that supplement 
the desired action, such as antacids, H2 blockers, and diuretics. Higher 
10 concentrations, up to about 98% by weight of the conjugate can be 
included. 

Pharmaceutically acceptable carriers included in tablets are binders, 
lubricants, diluents, disintegrating agents, coloring agents, flavoring 
agents, and wetting agents. Enteric-coated tablets, because of the 

15 enteric-coating, resist the action of stomach acid and dissolve or 

disintegrate in the neutral or alkaline intestines. Sugar-coated tablets 
are compressed tablets to which different layers of pharmaceutically 
acceptable substances are applied. Film-coated tablets are 
compressed tablets which have been coated with a polymer or other 

20 suitable coating. Multiple compressed tablets are compressed tablets 
made by more than one compression cycle utilizing the pharmaceutically 
acceptable substances previously mentioned. Coloring agents also can be 
used in the above dosage forms. Flavoring and sweetening agents are 
used in compressed tablets, sugar-coated, multiple compressed and 

25 chewable tablets. Flavoring and sweetening agents are especially useful 
In the formation of chewable tablets and lozenges. 

Liquid oral dosage forms include aqueous solutions, emulsions, 
suspensions, solutions and/or suspensions reconstituted from 
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non-effervescent granules and effervescent preparations reconstituted 
from effervescent granules. Aqueous solutions include, for example, 
elixirs and syrups. Emulsions are either oil-in-water or water-in-oil. 
Elixirs are clear, sweetened, hydroalcoholic preparations. 
5 Pharmaceutically acceptable carriers used in elixirs include solvents. 
Syrups are concentrated aqueous solutions of a sugar, for example, 
sucrose, and can contain a preservative. An emulsion is a two-phase 
system in which one liquid is dispersed in the form of small globules 
throughout another liquid. Pharmaceutically acceptable carriers used in 

10 emulsions are non-aqueous liquids, emulsifying agents and preservatives. 
Suspensions use pharmaceutically acceptable suspending agents and 
preservatives. Pharmaceutically acceptable substances used in 
non-effervescent granules, to be reconstituted into a liquid oral dosage 
form, include diluents, sweeteners and wetting agents. Pharmaceutically 

15 acceptable substances used in effervescent granules, to be reconstituted 
into a liquid oral dosage form, include organic acids and a source of car- 
bon dioxide. Coloring and flavoring agents are used in all of the above 
dosage forms. 

Solvents include glycerin, sorbitol, ethyl alcohol and syrup. 

20 Examples of preservatives include glycerin, methyl and jaropylparaben, 
benzoic add, sodium benzoate and alcohol. Examples of non-aqueous 
liquids utilized in emulsions include mineral oil and cottonseed oil. 
Examples of emulsifying agents include gelatin, acacia, tragacanth, 
bentonite, and surfactants such as polyoxyethylene sorbitan monooleate. 

25 Suspending agents include sodium carboxymethylcellulose, pectin, 

tragacanth, Veegum and acacia. Diluents include lactose and sucrose. 
Sweetening agents include sucrose, syrups, glycerin and artificial 
sweetening agents such as saccharin. Wetting agents include propylene 
glycol monostearate, sorbitan monooleate, diethylene glycol monolaurate 
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and polyoxyethylene lauryl ether. Organic adds include citric and tartaric 
acid. Sources of carbon dioxide include sodium bicarbonate and sodium 
carbonate. Coloring agents include any of the approved certified water 
soluble FD and C dyes, and mixtures thereof. Flavoring agents include 
5 natural flavors extracted from plants such fruits, and synthetic blends of 
conjugates which produce a pleasant taste sensation. 

For a solid dosage form, the solution or suspension, in for example 
propylene carbonate, vegetable oils or triglycerides, for example the 
formulation can be encapsulated in a gelatin capsule. Such solutions, and 
10 the preparation and encapsulation thereof, are disclosed in U.S. Patent 
Nos 4,328,245; 4,409,239; and 4,410,545. For a liquid dosage form, 
the solution, e.g., for example, in a polyethylene glycol, can be diluted 
with a sufficient quantity of a pharmaceutically acceptable liquid carrier, 
e.g., water, to be easily measured for administration. 
15 Alternatively, liquid or semi-solid oral formulations can be prepared 

by dissolving or dispersing the conjugate or derivative thereof in 
vegetable oils, glycols, triglycerides, propylene glycol esters (e.g., 
propylene carbonate) and other such carriers, and encapsulating these 
solutions or suspensions in hard or soft gelatin capsule shells. Other 
20 useful formulations include those set forth in U.S. Patent Nos, Re 28,819 
and 4,358,603. Briefly, such formulations include, but are not limited to, 
those containing a conjugate provided herein, a dialkylated mono- or poly- 
alkylene glycol. Including, but not limited to, 1 ,2-dimethoxy methane, 
diglyme, triglyme, tetraglyme, polyethylene glycol-350-dimethyl ether, 
25 polyethylene glycol-550-dimethyl ether, polyethylene glycol-750-dimethyl 
ether wherein 350, 550 and 750 refer to the approximate average 
molecular weight of the polyethylene glycol, and one or more 
anitoxidants, such as butylated hydroxytoluene (BHT), butylated 
hydroxyanisole (BHA), propyl gallate, vitamin E, hydroquinone, 
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hydroxycoumarins, ethanolamine, lecithin, cephalin, ascorbic acid, malic 
acid, sorbitol, phosphoric acid, thiodipropionic acid and its esters, and 
dithiocarbamates. 

Other formulations include, but are not limited to, aqueous 
5 alcoholic solutions including a pharmaceutically acceptable acetal. 

Alcohols used in these formulations are any pharmaceuticafly acceptable 
water-miscible solvents having one or more hydroxy! groups, including, 
but not limited to, propylene glycol and ethanol. Acetals include, but are 
not limited to, di{lower alkyi) acetals of lower alkyl aldehydes such as 
0 acetaldehyde diethyl acetal. 

In all embodiments, tablets and capsules formulations can be 
coated as known by those of skill in the art in order to modify or sustain 
dissolution of the conjugate. Thus, for example, they can be coated with 
a conventional enterically digestible coating, such as phenylsalicylate, 
5 waxes and cellulose acetate phthalate. 

2. Injectables, solutions and emulsions 
Parenteral administration, generally characterized by injection, 
either subcutaneously, intramuscularly or intravenously also is 
contemplated herein. Injectables can be prepared in conventional forms, 
0 either as liquid solutions or suspensions, solid forms suitable for solution 
or suspension in liquid prior to injection, or as emulsions. Suitable 
excipients are, for example, water, saline, dextrose, glycerol or ethanol. 
In addition. If desired, the pharmaceutical compositions to be 
administered can also contain minor amounts of non-toxic auxiliary 
5 substances such as wetting or emulsifying agents, pH buffering agents, 
stabilizers, solubility enhancers, and other such agents, such as for 
example, sodium acetate, sorbltan monolaurate, trtethanolamine oleate 
and cyclodextrlns. Implantation of a slow-release or sustained-release 
system, such that a constant level of dosage is maintained (see, e.g., 



wo 02/095007 



PCTAJS02/16819 



-176- 

U.S. Patent No. 3,710,795) also is contemplated herein. Briefly, a 
conjugate provided herein is dispersed in a solid inner matrix, e.g., 
polymethylmethacrylate, polybutylmethacrylate, plasticized or 
unplasticized polyvinylchloride, plasticized nylon, plasticized 
5 polyethyleneterephthalate, natural rubber, polyisoprene, polyisobutylene, 
polybutadiene, polyethylene, ethylene-vinylacetate copolymers, silicone 
rubbers, polydimethylsiloxanes, silicone carbonate copolymers, 
hydrophilic polymers such as hydrogels of esters of acrylic and 
methacrylic acid, collagen, cross-linked polyvinylalcohol and cross-linked 

10 partially hydrolyzed polyvinyl acetate, that is surrounded by an outer 
polymeric membrane, e.g., polyethylene, polypropylene, 
ethylene/propylene copolymers, ethylene/ethyl acrylate copolymers, 
ethylene/vinylacetate copolymers, silicone rubbers, polydimethyl 
siloxanes, neoprene rubber, chlorinated polyethylene, polyvinylchloride, 

15 vinylchloride copolymers with vinyl acetate, vinylidene chloride, ethylene 
and propylene, ionomer polyethylene terephthalate, butyl rubber 
epichlorohydrin rubbers, ethylene/vinyl alcohol copolymer, ethylene/vinyl 
acetate/vinyl alcohol terpolymer, and ethylene/vinyloxyethanol copolymer, 
that is insoluble in body fluids. The conjugate diffuses through the outer 

20 polymeric membrane in a release rate controlling step. The percentage of 
conjugate contained in such parenteral compositions is highly dependent 
on the specific nature thereof, as well as the activity of the conjugate 
and the needs of the subject. 

Parenteral administration of the compositions includes intravenous, 

25 subcutaneous and intramuscular administrations. Preparations for 
parenteral administration include sterile solutions ready for injection, 
sterile dry soluble products, such as lyophilized powders, ready to be 
combined with a solvent just prior to use, including hypodermic tablets, 
sterile suspensions ready for injection, sterile dry insoluble products ready 
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to be combined with a vehicle just prior to use and sterile emulsions. The 
solutions can be either aqueous or nonaqueous. 

If administered intravenously, suitable carriers include physiological 
saline or phosphate buffered saline (PBS), and solutions containing 
5 thickening and solubilizing agents, such as glucose, polyethylene glycol, 
and polypropylene glycol and mixtures thereof. 

Pharmaceutically acceptable carriers used in parenteral preparations 
include aqueous vehicles, nonaqueous vehicles, antimicrobial agents, 
isotonic agents, buffers, antioxidants, local anesthetics, suspending and 
10 dispersing agents, emulsifying agents, sequestering or chelating agents 
and other pharmaceutically acceptable substances. 

Examples of aqueous vehicles include Sodium Chloride Injection, 
Ringers Injection, Isotonic Dextrose Injection, Sterile Water Injection, 
Dextrose and Lactated Ringers Injection. Nonaqueous parenteral vehicles 
15 include fixed oils of vegetable origin, cottonseed oil, corn oil, sesame oil 
and peanut oil. Antimicrobial agents in bacteriostatic or fungistatic 
concentrations must be added to parenteral preparations packaged in 
multiple-dose containers which include phenols or cresols, mercurials, 
benzyl alcohol, chlorobutanol, methyl and propyl p-hydroxybenzoic acid 
20 esters, thimerosal, benzalkonium chloride and benzethonium chloride. 
Isotonic agents include sodium chloride and dextrose. Buffers include 
phosphate and citrate. Antioxidants include sodium bisulfate. Local 
anesthetics include procaine hydrochloride. Suspending and dispersing 
agents include sodium carboxymethylcelluose, hydroxypropyl 
25 methylcellulose and polyvinylpyrrolidone. Emulsifying agents include 

Polysorbate 80 (TWEEN® 80). A sequestering or chelating agent of metal 
ions include EDTA. Pharmaceutical carriers also include ethyl alcohol, 
polyethylene glycol and propylene glycol for water miscible vehicles and 
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sodium hydroxide, hydrochloric acid, citric acid or lactic acid for pH 
adjustment. 

The concentration of the conjugate is adjusted so that an injection 
provides an effective annount to produce the desired pharmacological 
5 effect. The exact dose depends on the age, weight and condition of the 
patient or animal as is known in the art. 

The unit-dose parenteral preparations are packaged in an ampule, a 
vial or a syringe with a needle. All preparations for parenteral 
administration must be sterile, as is known and practiced in the art. 
10 Illustratively, intravenous or intraarterial infusion of a sterile 

aqueous solution containing a conjugate is an effective mode of 
administration. Another embodiment is a sterile aqueous or oily solution 
or suspension containing a conjugate injected as necessary to produce 
the desired pharmacological effect. 
15 Injectables are designed for local and systemic administration. 

Typically a therapeutically effective dosage is formulated to contain a 
concentration of at least about 0.1 % w/w up to about 90% w/w or 
more, genrally more than 1 % w/w of the conjugate to the treated 
tissue(s). The conjugate can be administered at once, or can be divided 
20 into a number of smaller doses to be administered at intervals of time. It 
is understood that the precise dosage and duration of treatment is a 
function of the tissue being treated and can be determined empirically 
using known testing protocols or by extrapolation from in vivo or in vitro 
test data. It is to be noted that concentrations and dosage values can 
25 also vary with the age of the individual treated. It Is to be further 
understood that for any particular subject, specific dosage regimens 
should be adjusted over time according to the individual need and the 
professional judgment of the person administering or supervising the 
administration of the formulations, and that the concentration ranges set 
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forth herein are exemplary only and are not intended to limit the scope or 
practice of the claimed formulations. 

The conjugate can be suspended in micronized or other suitable 
form or can be derivatized to produce a more soluble product. The form 
5 of the resulting mixture depends upon a number of factors, including the 
intended mode of administration and the solubility of the conjugate in the 
selected carrier or vehicle. The effective concentration is sufficient for 
ameliorating the symptoms of the condition and can be empirically 
determined. 
10 3. Lyophilized powders 

Of interest herein are also lyophilized powders, which can be 
reconstituted for administration as solutions, emulsions and other 
mixtures. They also can be reconstituted and formulated as solids or 
gels. 

15 The sterile, lyophilized powder is prepared by dissolving a 

conjugate provided herein, or a pharmaceutically acceptable derivative 
thereof, in a suitable solvent. The solvent can contain an excipient which 
improves the stability or other pharmacological component of the powder 
or reconstituted solution, prepared from the powder. Excipients that can 

20 be used include, but are not limited to, dextrose, sorbital, fructose, corn 
syrup, xylitol, glycerin, glucose, sucrose or other suitable agent. The 
solvent can also contain a buffer, such as citrate, sodium or potassium 
phosphate or other such buffer known to those of skill in the art at, 
typically, about neutral pH. Subsequent sterile filtration of the solution 

25 followed by lyophilization under standard conditions known to those of 
skill in the art provides the desired formulation. Generally, the resulting 
solution will be apportioned into vials for lyophilization. Each vial will 
contain a single dosage (such as 10-1000 mg, for example 100-500 mg) 
or multiple dosages of the conjugate. The lyophilized powder can be 
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stored under appropriate conditions, such as at about 4 °C to room 
tennperature. 

Reconstitution of this lyophilized powder with water for injection 
provides a formulation for use in parenteral administration. For 
5 reconstitution, generally about 1-50 mg, such 5-35 mg or about 9-30 mg 
of lyophilized powder, is added per mL of sterile water or other suitable 
carrier. The precise amount depends upon the selected conjugate, 
intended subject, and other empircally determinable parameters. Hence 
the amount can be empirically determined. 

10 4. Topical administration 

Topical mixtures are prepared as described for the local and 
systemic administration. The resulting mixture can be a solution, 
suspension, emulsions or the like and are formulated as creams, gels, 
ointments^ emulsions, solutions, elixirs, lotions, suspensions, tinctures, 

15 pastes, foams, aerosols, irrigations, sprays, suppositories, bandages, 
dermal patches or any other formulations suitable for topical 
administration. 

The conjugates or pharmaceutically acceptable derivatives thereof 
can be formulated as aerosols for topical application, such as by 

20 inhalation (see, e,g,, U.S. Patent Nos. 4,044,126, 4,414,209, and 

4,364,923, which describe aerosols for delivery of a steroid useful for 
treatment of inflammatory diseases, particularly asthma). These 
formulations for administration to the respiratory tract can be in the form 
of an aerosol or solution for a nebulizer, or as a microfine powder for 

25 insufflation, alone or in combination with an inert carrier such as lactose. 
In such a case, the particles of the formulation will typically have 
diameters of less than 50 microns, generally less than 10 microns. 

The conjugates can be formulated for local or topical application, 
such as for topical application to the skin and mucous membranes, such 
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as in the eye, in the form of gels, creams, and lotions and for application 
to the eye or for intracisternal or intraspinal application. Topical 
administration is contemplated for transdermal delivery and also for 
administration to the eyes or mucosa, or for inhalation therapies. Nasal 
5 solutions of the conjugate alone or in combination with other 
pharmaceutically acceptable excipients also can be administered. 

These solutions, particularly those intended for ophthalmic use, can 
be formulated as 0.01 % - 10% isotonic solutions, pH about 5-7, with 
appropriate salts. 
10 5. Compositions for other routes of administration 

Other routes of administration, such as topical application, 
transdermal patches, and rectal administration are also contemplated 
herein. 

For example, pharmaceutical dosage forms for rectal administration 
1 5 are rectal suppositories, capsules and tablets for systemic effect. Rectal 
suppositories are used herein mean solid bodies for insertion into the 
rectum which melt or soften at body temperature releasing one or more 
conjugates. Pharmaceutically acceptable substances utilized in rectal 
suppositories are bases or vehicles and agents to raise the melting point. 
20 Examples of bases include cocoa butter (theobroma oil), glycerin-gelatin, 
carbowax (polyoxyethylene glycol) and appropriate mixtures of mono-, di- 
and triglycerides of fatty acids. Combinations of the various bases can be 
used. Agents to raise the melting point of suppositories include 
spermaceti and wax. Rectal suppositories can be prepared either by the 
25 compressed method or by molding. The typical weight of a rectal 
suppository is about 2 to 3 gm. 

Tablets and capsules for rectal administration are manufactured 
using the same pharmaceutically acceptable substance and by the same 
methods as for formulations for oral administration. 
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6. Articles of manufacture 

The conjugates or pharmaceutically acceptable derivatives can be 
packaged as articles of manufacture containing packaging material, a 
conjugate or pharmaceutically acceptable derivative thereof provided 
5 herein, w/hich is used for treatment, prevention or amelioration of one or 
more symptoms associated with proliferative diseases or disorders, and a 
label that indicates that the conjugate or pharmaceutically acceptable 
derivative thereof is used for treatment, prevention or amelioration of one 
or more symptoms associated with proliferative diseases or disorders. 

10 The articles of manufacture provided herein contain packaging 

materials. Packaging materials for use in packaging pharmaceutical 
products are well known to those of skill in the art. See, e.g., U.S. 
Patent Nos, 5,323,907, 5,052,558 and 5,033,252. Examples of 
pharmaceutical packaging materials include, but are not limited to, blister 

15 packs, bottles, tubes, inhalers, pumps, bags, vials, containers, syringes, 
bottles, and any packaging material suitable for a selected formulation 
and intended mode of administration and treatment. A wide array of 
formulations of the conjugates and compositions provided herein are 
contemplated as are a variety of treatments for any disorder In which a 

20 cell surface protease, or a soluble, shed or secreted form thereof, is 
implicated. 

F. Evaluation of the activity of the conjugates 

Standard physiological, pharmacological and biochemical 
procedures are available for testing the conjugates to identify those that 
25 possess therapeutic activity upon action of a cell surface protease or a 
soluble, shed, or released form thereof, fn vitro and in vivo assays that 
can be used to evaluate therapeutic activity, such as cytotoxicity, of the 
conjugates will depend upon the therapeutic agent being tested. 
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Exemplary assays are discussed briefly below with reference to 
cytotoxic conjugates (see, also. Examples). It is understood that the 
particular activity assayed will depend upon the conjugated therapeutic 
agent. 

5 1 . //? vitro Assays 

The therapeutic activity, such as cytotoxicity, of the conjugates 
provided herein can assessed by any assays normally used for assessing 
the therapeutic activity, such as cytotoxicity, of the unconjugated 
therapeutic agent. Numerous such assays are known, for example, 

10 assays can employ cells that express the targeted cell surface protease 
and the therapeutic activity of the therapeutic agent is assessed. For 
example, cytoxicity can be assessed by measuring cell viability or by 
measuring cell proliferation, such as by incorporation of a labeled 
nucleotide or other such label. Generally the activity is compared with 

15 cells that do not express the targeted protease. 

For example, the cells will be any that express a targeted MTSP or 
endothellase. Such cells can be obtained by choosing cells known to 
express the cell surface protease, such as by determining tissue 
expression profiles, as discussed above, or by screening a variety of cell 

20 lines with an antibody for a targeted protease, or for the protease activity 
in the presence of a labeled, such as a chromogenic, substrate for the 
protease in the presence and absence of a known inhibitor of the targeted 
protease. 

Alternatively, nucleic acid encoding the protease can be introduced 
25 in a cell line that does not express the protease, and expressed therein to 
produce a cell line that expresses the protease of interest. The resulting 
recombinant cells can be used in cytotoxicity assays. 
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2. in vivo Assays 

Numerous animal models for assessing therapeutic activity are 
known. Any suitable in vivo model can be used. Exemplary are the 
mouse xenograft model and chicken embryo models. 
5 Chicken Embryo Model 

The CAM model (chick embryo chorioallantoic membrane model; 
Ossowski (1988) J. Ceil Biol. 707:2437-2445), provides another method 
for evaluating the inhibitory activity of a test compound. In the CAM 
model, tumor cells invade through the chorioallantoic membrane 

10 containing CAM (with tumor cells in the presence of several serine 

protease inhibitors results in less or no invasion of the tumor cells through 
the membrane). Thus, the CAM assay is performed with CAM and tumor 
cells in the presence and absence of various concentrations of test 
compound. The invasiveness of tumor cells is measured under such 

15 conditions to provide an indication of the compound's inhibitory activity. 
A compound having inhibitory activity correlates with less tumor invasion. 

Thus, the CAM assay is performed with CAM and tumor cells in 
the presence and absence of various concentrations of a test compound. 
A compound having activity correlates with a change in tumor invasion 

20 and/or tumor growth. 

For example, the ability of a cell surface protease to liberate a 
therapeutic agent, such as a cytotoxic agent, or the activity of a 
conjugate agent can be assessed using this model. If the therapeutic 
agent is released from the compound and it is an inhibitory agent there 

25 will be less tumor invasion or a decrease in size of the tumor. If the 
therapeutic agent is inactive in the conjugate, there will be no effect on 
tumor invasion. 
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The CAM model also is used in a standard assay of angiogenesis 
{i.e., effect on formation of new blood vessels (Brooks et a/. (1991) 
Methods in Molecular Biology 725:257-269). According to this model, a 
filter disc containing an angiogenesis inducer, such as basic fibroblast 
5 growth factor (bFGF) is placed onto the CAM. Diffusion of the cytokine 
into the CAM induces local angiogenesis, which can be measured in 
several ways such as by counting the number of blood vessel branch 
points within the CAM directly below the filter disc. The ability of 
identified compounds to inhibit cytokine-induced angiogenesis can be 

10 tested using this model. A test compound can either be added to the 
filter disc that contains the angiogenesis inducer, be placed directly on 
the membrane or be administered systemicaily. The extent of new blood 
vessel formation in the presence and/or absence of test compound can be 
compared using this model. The formation of fewer new blood vessels in 

15 the presence of a test compound would be indicative of anti-angiogenesis 
activity. 

This can be adapted for use with the conjugates herein to 1) 
assess the activity of a therapeutic agent in the conjugate; and 2) to 
assess the ability of a particular cell surface protease to liberate a 
20 therapeutic agent from a conjugate. 

Mouse xenograft model 
In vivo activity can be a assessed using recognized animal 
models, such as the well-known mouse xenograft model for anti-tumor 
activity (see, e.g., Beitz et aL (1992) Cancer Research 52:227-230; 
25 Houghton et al. (1982) Cancer Res, 42:535-539; Bogden ©f a/. (1981) 
Cancer (Philadelphia) 48'A0'20; Hoogenhout a/. (1983) Int. J. Radlat, 
Oncol., Biol. Phys. 5:871-879; Stastny et aL (1993) Cancer Res. 
55:5740-5744). The in vivo mouse solid tumor xenograft model is used 
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in assays for that test an agent's ability to inhibit tumor cell proliferation 
and/or spontaneous metastasis. For example, a conjugate is evaluated 
for anti-tumor activity against any tumor subtype that expresses the 
targeted cell surface protease, e.g., an ovarian tumor, in a mouse tumor 
5 xenograft model. Nude mice are given one or more, such as four 

intravenous injections of the conjugate. Dosing material is prepared by 
mixing the test material with appropriate volumes of, for example, 
PBS/0.1% BSA to achieve the desired doses. Mice IV injections (250- 
300 ul) into the tail vein for the duration of the experiment, such as, for 

10 example, days 5, 12, 19 and 26, with day 1 designated as the day that 
the tumor cells are injected into the mice. Doses are either fixed or 
normalized for differences in body weight. Tumor volume is measured 
twice weekly for a selected period. 

Female Balb/c nu/nu athymic mice (Roger Williams Hospital Animal 

15 Facility, Providence, Rl), 8-12 weeks old are suitable mice. They should 
be maintained in an aseptic environment and selected such that body 
weights range from about 25-30 grams the day prior to dosing. Animals 
are maintained in a quarantined room and handled under aseptic 
conditions. Food and water are supplied ad libitum. Appropriate tumor 

20 cells can be obtained, for example, from the American Type Culture 
Collection (Rockville, MD) and grown in modified Eagle's medium 
supplemented with 10% fetal calf serum. A selected number of days, 
such as five days prior to Injection of the test material, mice receive a 
subcutaneous injection of tumor cells in the right rear flank. 

25 Calipers are used to measure the dimensions of each tumor. 

Measurements (mm) of maximum and minimum width are performed prior 
to injection of the test material and at selected, such as bi-weekly, 
intervals for the duration of the experiment. Tumor volumes (mm^) can 
be computed, for example, using the formula: 
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Volume = [(width)^(length)]/2. 

G. Methods for Identifying proteases to target 

Also provided are methods for identifying proteases to target 
conjugates for treatment of diseases. Thie methods involve identifying 
5 cell-surface protease-associated disease by identifying a cell involved in 
the disease process or a cell in the vicinity of the cell involved in the 
disease process. For example, if disease involves a particular tumor, a 
protease present on the particular tumor or on cells that a located in the 
vicinity thereof is identified. A cell surface protease on the cell for 
10 targeting and substrates therefor are then identified. Conjugates that 
target such proteases as provided herein can then be prepared. 

The following examples are included for illustrative purposes only 
and are not intended to limit the scope of the invention. 
15 EXAMPLE 1 

General Procedures for Preparing Peptide-doxorubicin conjugates 

Step A: Synthesis of Peptides on Wang resin 

Peptides were prepared automatically using an ABI 431 A peptide 
synthesizer from Perseptive Biosystems on preloaded Wang resin {0.25 
20 mmol). The ABI 431 A uses HOBT, HBTU, DIEA activation. The 

synthesis of N-acetyi (or other amide) capped peptides involved the use 
of AcOH (or other respective carboxylic acid) during the final coupling 
step on the ABI 431 A. Other N-terminal caps where attached manually 
by using the following reagents: For carbamates and sulfonamides the 
25 peptides were capped with ROCOCI or RSOjCI and DIEA (4 equivalents 
each, 1 hr| in DMF (3 mL). 

Step B: Cleavage of peptides from Wang resin 

The cleavage of peptides from Wang resin involved shaking the 
resin with 2 mL TFA/HjO (95:5) for 45 min. The resin was removed by 
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filtration and the filtrate was allowed to stand for an additional hour. The 
solution was concentrated to a residue. The crude peptide was analyzed 
by analytical HPLC (system A). Typical purity of the crude peptide 
ranged from 80% to 95%. The peptides were purified by preparative 
5 HPLC (system B) using an appropriate gradient (typically 10-30%). Pure 
fractions were then lyophilized to provide the desired peptide as a white 
solid. Typical yields were 20-50% and a purity of 96-99%. 
Analytical HPLC conditions (System A) 

Column: Chromolith RP-18e 4.6 mm x 100 mm from EM science 
10 Gradient: 5-50% B in A over 6 min 

Flow Rate: 4 mL/min 

Solvent A: 0.1 % TFA in water 

Solvent B: 0.1 % TFA in acetonitrile 

Wavelength: 210 nm, 280 nm 

15 Preparative HPLC conditions (System B) 

Column: Ultro 1 20 5 CI 8Q 1 50 x 20 mm from Peeke Scientific 

Gradient: 0-20%, or 10-30% or 20-40% B in A over 40 min 

Flow Rate: 1 8 mL/min 

Solvent A: 0.1% TFA in water 
20 Solvent B: acetonitrile 

Wavelength: 214 nm 

Step C: Coupling of peptide acids to doxorubicin 

To a mixture of peptide acid (0.052 mmol, 1.2 equivalents), 

doxorubicin hydrochloride (0.043 mmol, 25 mg), and HATU (0.0604 
25 mmol, 22.9 mg, 1.4 equivalents) was added DMF (1 mL) then 2,6- 

lutidine (0.17 mmol, 20 //L, 4 eqiuvalents). The mixture was mixed until 

a homogeneous solution was obtained. After 4 to 24 hours (monitor by 

HPLC system A) the reaction was diluted with water (9 mL) and directly 

purified by preparative HPLC (system D). Pure fractions were then 
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lyophitized to provide the desired peptide doxorubicin conjugate as a 
fluffy red solid. The quality of the final conjugate was verified by 
analytical HPLC (system C) and mass spectroscopy. Typical yields were 
10-30% with a purity of 95-99%. {Note: when the peptide acid 
5 contained a histidine residue DIEA was substituted as the base and the 
reaction time was shortened to 1 hour). 

Deprotection of fluorenylmethylesters of peptide doxorubicin 
conjugates: In cases where free carboxylic acid is present in the 
conjugate a fluorenyl methyl ester was used to protect a carboxylic acid 
10 during coupling of the C-terminus of the peptide acid to doxorubicin, the 
flourenylmethyl group was subsequently removed with 10% morpholine 
in DMF for 1 hour. 

Analytical HPLC conditions (System C) 

Column: Chromolith RP-18e 4.6 mm x 100 mm from EM science 
15 Gradient: 5-50% B in A over 6 min 

Flow Rate: 4 mL/min 

Solvent A: 0.1% TFA in water 

Solvent B: 0.1% TFA in acetonitrile 

Wavelength: 210 nm, 280 nm 

20 Examples of retention times (min) 

Doxorubicin 4.05 

Ac-Gly-Ser-Gly-Arg-Ser-nLeu-Dox 4.34 

MeOCO-Thr-Gly-Arg-Ser-nLeu-Dox 4.39 

PhS02-Thr-Gly-Arg-Ser-nLeu-Dox 4.83 
25 N,N-dimethy1gIycine-Thr-Gly-Arg-Ser-nLeu-Dox 4.27 

Ac-Thr-Gly-Arg-Ser-nLeu-Dox 4.32 

Preparative HPLC conditions (System D) 

Column: Uitro 1 20 5 CI 8Q 1 50 x 20 mm from Peeke Scientific 
Gradient: 10-30% B in A over 40 min 
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Flow Rate: 



18 mL/mrn 



Solvent A: 



0,1% acetic acid in water 



Solvent B: 



acetonitrile 



Wavelength: 



214 nm 



EXAMPLE 2 



10 



15 



20 



25 



Preparation of Ac-GIy-Ser-Gly-Arg-Ser-nLeu-Dox 

Step A: Manual synthesis of Ac-Gly-Ser(tBu)-Gly-Arg(Pbf)- 
Ser(tBu)-nLeu-Wang resin 

In a 250 mL fritted peptide synthesis vessel equipped with nitrogen 
agitation and vacumm assisted drainage, Fnnoc-nL-Wang resin (nova- 
biochem, 3.3 grams, 0.9 mnnol/g, 3 mmo!) was pre-swelled for 30 min 
using DMF. The peptide was then elongated by repeating the 4 s.tep 
procedure below a total of five times with the following Fmoc 
aminoacids: Fmoc-Ser{tBu)-OH, Fmoc-Arg(Pbf)-OH, Fmoo-Gly-OH, Fmoc- 
Ser(tBu)-OH, Fmoc-Gly-OH. 

Iterative Coupling Procedure 

1 . the resin was mixed with 20% piperdine in DMF (100 mL) 
for 5 min then drained (repeat 3 times). 

2. the resin was agitated with DMF {100 mL) for 30 sec then 
drained (repeat 3 times). 

3. to a mixture of Fmoc-aminoacid (12 mmol), HOST (12 mmol, 
4 equivalents, 1.622 g), TBTU (1 1.7 mmo!, 3.9 equivalents, 3.757 
g), DMF (10 mL) and NMP (90 mL) was added DIEA (12 mmol, 4 
equivalents, 2.10 mL). After stirring for 5 min to allow pre- 
activation, the solution was added to the synthesis vessel. The 
reaction was checked for completion by ninhydrin test and then 
drained. (If the ninhydrid test was blue, a double coupling (repeat 
step 3) was performed. 

4. the resin was agitated with DMF (100 mL) for 30 sec then 
drained (repeat 3 times). 
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The elongated resin (Fmoc-Gly-Ser(tBu)-Gly-Arg{Pbf)-Ser{tBu}-nLeu- 
Wang resin) was treated to steps 1 and 2 above to remove the Fmoc 
group. A solution of acetic anhydride (15 mmol, 5 equivalents, 1 .42 mL), 
DIEA (15 mmol, 5 equivalents, 2.62 mL), DMF (10 mL) and NMP (90 mL) 
was added to the reaction vessel. After 1 hour the resin was washed 
with DMF (100 mL, 3 times), CH2CI2 (100 mL, 3 times) and MeOH (100 
mL, 3 times). The resin was dried under vacuum for 15 hours. 
Step B: Preparation of Ac-Gly-Ser-Gly-Arg-Ser-nLeu-OH 
To the above synthesis vessel containing Ac-Gly-Ser(tBu)-Gly- 
Arg{Pbf)-Ser(tBu)-nLeu-Wang resin (3 mmol) was added TFA/HjO (95:5, 
50 mL). After gently agitation for 45 min the cleavage solution was 
collected and the filtrate was allowed to stand for an additional 90 min. 
The solution was concentrated to a residue. The crude peptide was 
analyzed by analytical HPLC (system A, RT=1.73, purity = 90%). The 
residue was dissolved In water (50 mL) and hexanes (10 mL) and mixed. 
The hexanes layer was removed and the aqueous layer bubbled with 
nitrogen to evaporate any remaining hexanes. The crude peptide was 
purified by preparative HPLC (system E). Pure fractions were then 
lyophilized to provide Ac-Gly-Ser-Gly-Arg-Ser-nLeu-OH (1.04 g, 1.68 
mmol, 56%) as a white solid. The purity was evaluated by analytical 
HPLC (system A, RT= 1.73 min, 97% purity) and the constitution by 
mass spectrospcopy (ion observed at 617.9). 
Preparative HPLC conditions (System E) 

Column: Waters Delta-Pak radial compression column, 15 um, 100A 

Gradient: 5-15% B in A over 40 min 

Flow Rate: 80 mL/min 

Solvent A: 0.1 % acetic acid in water 

Solvent B: acetonitrile 

Wavelength: 214 nm 



wo 02/095007 



PCT/US02/16819 



-192- 

Step C: Preparation of Ac-Gly-Ser-Gly-Arg-Ser-nLeu-Dox 

To a mixture of Ac-Gly-Ser-Gly-Arg-Ser-nLeu-OH (1 .68 mmol, 1.04 
g, 1.1 equivalents), doxorubicin hydrociiloride (1.53 mmol, 887.8 mg), 
and HATU (1.76 mmol, 669.6 mg, 1.15 equfvalents) was added DMF (40 
5 mL) then 2,6-lutrdine (6.12 mmol, 709 pL, 4 eqiuvalents). The solution 
was stirred for 18 hours. The reaction was diluted with water (100 mL), 
acidified with acetic acid (400 jjL) and purified immediately in three 
batches by preparative HPLC (system E). Each red colored fraction was 
analyzed by analytical HPLC (system F). Fractions of greater than 95% 

10 purity were then combined. The acetonitrile was removed under vacuum 
and the remaining solution was lyophilized to provide Ac-Gly-Ser-Gly-Arg- 
Ser-nLeu-Dox (0.682 mmol, 780 mg, 45%) as a fluffy red solid. The 
purity was evaluated by analytical HPLC (system F, RT = 3.51 min, 95% 
purity) and the constitution by mass spectrospcopy (ion observed at 

15 1143.5). 

Analytical HPLC conditions (System F) 

Column: Chromolith RP-ISe 4.6 mm x 100 mm from EM science 
Gradient: 20-40% B in A over 6 min 
Flow Rate: 4 mL/min 
20 Solvent A: 0.1 % TFA in water 

Solvent B: 0.1% TFA in acetonitrile 
Wavelength: 210 nm, 280 nm 

Preparative HPLC conditions (System E) 

Column: Waters Delta-Pak radial compression column, 15 um, 100A 
25 Gradient: 15-25% B in A over 40 min 
Flow Rate: 80 mL/min 
Solvent A: 0.1 % acetic acid in water 
Solvent B: acetonitrile 
Wavelength: 214 nm 
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EXAMPLE 3 

General Procedures for Preparing Peptide-Taxol conjugates 
Step A: Synthesis of Peptides on Wang resin 
See Example 1, Step A. 
5 Step B: Cleavage of peptides from Wang resin 

See Example 1 , Step B. 

Step C: Coupling of peptide acids to 7-Gly-Taxol or 7-Ala-Taxol 

To a mixture of peptide acid (0.0121 mmol, 1.1 equivalents), 7- 
Gly-Taxol of 7-Ala-Taxol (0.01 1 mmol), and HATU (0.0154 mmol, 5.9 

10 mg, 1.4 equivalents) was added DMF (0.3 mL) then 2,6-lutidine (0.044 
mmol, 5.1 //L, 4 eqiuvalents). The mixture was mixed until a 
homogeneous solution was obtained. After 4 to 24 hours (monitor by 
HPLC system H) the reaction was diluted with water (9 mL) and directly 
purified by preparative HPLC (system I). Pure fractions were then 

15 lyophilized to provide the desired peptide taxol conjugate as a fluffy white 
solid. The quality of the final conjugate was verified by analytical HPLC 
(system H) and mass spectroscopy. Typical yields were 30-50% with a 
purity of 96-99%. 

Analytical HPLC conditions (System H) 
20 Column: Chromolith RP-18e 4.6 mm x 100 mm from EM science 

Gradient: 5-90% B in A over 6 min 

Flow Rate: 4 mL/min 

Solvent A: 0.1 % TFA in water 

Solvent B: 0.1 % TFA in acetonitrile 
25 Wavelength: 210 nm, 280 nm 

Examples of retention times (min) 

Ac-Gln-Ser-Arg-Ala-Ala-Taxol 2.86 

Ac-Gln-Ser-Arg-Ser-Ala-Ala-Taxol 2.79 

Ac-Ser-Gly-Arg-Ala-Ser-Ala-Taxol 2.87 
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10 



Ac-Arg-Ser-Arg-Ala-Ala-Taxol 2.80 
Ac-Ser-Gly-Arg-Ser-Ser-Ala-Taxol 2.81 
Preparative HPLC conditions (System 1) 

Column: Ultro 120 5 CI 8Q 150 x 20 mm from Peeke Scientific 
20-45% B in A over 40 min 
18 mL/min 
0.1 % TFA in water 
acetonitrile 

214 nm 

EXAMPLE 4 

Preparation of N-Ac-Arg-Gln-Ser-Arg-Ala-Ala-DOX 



Gradient: 
Flow Rate: 
Solvent A: 
Solvent B: 
Wavelength: 



OH 




HjN 




Step A: N-Ac-Arg-Gln-Ser-Arg-Ala-Ala-OH 

Using the following general procedure, the A/-acetyl peptidic 
15 substrate A/-Ac-Arg-Gln-Ser-Arg-Ala-Ala-OH was synthesized in a peptide 
synthesis flask. Commencing with commercial Fmoc-Ala-Wang resin 
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(0.35 g, 0.84 mmol. Nova), standard Fmoc-deprotection with 20% 
piperidine was followed by a sequential iterative coupling-Fmoc 
deprotection strategy. Each coupling employed a 3-fold excess (2.52 
mmol) of Fmoc-Ala, Fmoc-Arg{Boc)2, Fmoc-Ser(tBu), Fmoc-Gln(Trt) and 
5 Fmoc-Arg(Boc)2/ respectively. Couplings were achieved using PyBOP 

(2.52 mnnol) and DIEA (2.52 mmol) in DMF solvent. During each coupling 
cycle, the Fmoc protecting group was removed using 20% piperidine in 
DMF. After removal of the A/-terminal Fmoc group, capping with acetic 
anhydride (1.43 mmol, 1.7 equiv.), DMAP (0.25 mmol, 0.3 equiv.), and 

10 DIEA (1.26 mmole, 1 .5 equiv.) afforded the resin-bound A/-acetyl 

intermediate. The protected peptide resin was treated with 50% TFA in 
methylene chloride for 30 min to cleave the Wang resin and then the Boc, 
Trt and t-Bu protecting groups were removed with 70% TFA in methylene 
chloride. Solvent and other volatile byproducts were evaporated under 

15 reduced pressure and the crude product was dissolved in water and 

lyophilized to afford the title compound as a nearly colorless, amorphous 
solid. Mass spectral analysis confirmed the desired molecular weight. 
HPLC analysis indicated the product to be of approximately 95% purity. 
The peptide carboxylic acid intermediate can be further purified by 

20 trituration or by preparative HPLC, if desired. 

Step B: N-Ac-Arg-Gln-Ser-Arg-Ala-Ala-DOX 

The intermediate from Step A (20 mg, 0.027 mmol) was dissolved 
in dry DMF (0.8 mL) and was stirred at room temperature under a 
nitrogen atmosphere. To this solution was added doxorubicin 
25 hydrochloride (15.6 mg, 0.027 mmol), EDC (6.8 mg, 0.035 mmol), HOAt 
(4.8 mg, 0.035 mmol) and 2,6-lutrdine (7.3 //L, 0.06 mmol). Stirring was 
continued until completion of the coupling, as monitored by analytical 
HPLC (system J, see below). The solution was filtered and the crude 
product was purified by CIS RP-HPLC (A = 0.1% AcOH/HaO; B = CH3CN), 
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gradient elution 100% to 60% A over 60 min). Homogeneous product 
fractions (evaluated by HPLC, system J) were pooled and lyophilized to 
afford the title compound as a light red solid. 
HPLC conditions, system J: 
5 Column: Phenomenex 15 cm #00F-3033-E0, CI 8 

Eluant: Gradient 95:5 (A:B) to 25:75 (A:B) over 20 min. 

A = 0.1% TFA/H2O, B-0.1%TFA/Acetonitrile 
Flow: 1 mL/min. 

Wavelength: 210 nm, 280 nm 

10 Retention times: Doxorubicin = 8.89 min. 

N-Ac-Arg-Gln-Ser-Arg-Ala-Ala-Dox = 8.4 min. 

Physical Properties: 
Molecular Formula: C55H7gNi402o 
Molecular Weight: 1255.3 
15 Low Resolution Mass Spec: 628.2 (M + 2/2) 

Table 2 lists data for additional peptidic substrate-Doxorubrcin 
conjugates. These conjugates were prepared from the appropriate amino 
acid precursors that were elaborated by the general procedures described 
in Example 4. 

20 

TABLE 2 



Peptidic substrate-DOX Conjugate 


Mass 
Spectrum 


HPLC-Retentlon 
Time (min.) 


Acetyl-Arg-Arg-Gln-Ser-Arg-Ala-Ala-DDX 


471.2 
(IVI + 3/3) 


8.23 


Acetyl-Leu-Arg-Arg-Gln-Ser-Arg-Ala-Ala-DOX 


509.2 
(M + 3/3) 


8.60 
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EXAMPLE 5 

Determination of times to 50% cleavage of Doxorubicin/peptidic 
substrate Conjugates by the recombinant protease domain of MTSP1 

One millimolar stock solutions were prepared for each peptidic 
5 substrate conjugate in double distilled water. Cleavage reactions were 
then performed in which 100 //M conjugate was mixed with 1 or 1 0 nM 
of the recombinantly-produced active single chain protease domain of 
MTSP1 (residue 615-855 in SEQ ID No. 2, encoded by nucleotides 1865- 
2582 in SEQ ID No. 1) in 29.2 mM Tris, pH 8.4, 29.2 mM Imidazole, 217 
0 mM NaCI. Final reaction volume was 200 jjL, These reactions were 
incubated in a water bath at 37 °C. At times ranging from 2 to 128 
minutes, 20 fjL samples were removed, and enzymatic activity was 
stopped by the addition of trifluoroacetic acid to 0.33%. The amount of 
hydrolysis in each sample was measured by reverse phase HPLC. Percent 
hydrolysis was then calculated by dividing the area under the product 
peak by the sum of the areas under substrate and product peaks. 
Percent unhydrolyzed substrate was plotted against log of reaction times, 
and the plots were fit to sigmoidal curves using Prism software from 
Graphpad Inc. (San Diego, CA) to determine times at which 50% of each 
substrate was cleaved. 

Results for certain of the conjugates provided herein are shown in 
Figure 1 (conditions: 1 nM MTSP1 with 100 )t/M conjugate at 37 °C in 
12 mM tns(hydroxymethyl)aminomethane, pH 8.0, 25 mM NaCI, 0.5 mM 
CaCIz; reactions were quenched with 0.33% trifluoroacetic acid). 

EXAMPLE 6 

//? v/tro assay of cytotoxicity of Conjugates 

The cytotoxicity of the conjugates also can be tested to confirm 
that the conjugates act as prodrugs. The conjugates are tested against a 
line of cells, which is known to be killed by unmodified cytotoxic agent, 
using an Alamar Blue assay. Cells, such as LNCaP cells (The American 
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Type Culture Collection (Rockville, Maryland)), that express a cell surface 
protease, such as MTSP1 or endotheliase, are seeded in 96 well plates at 
a density of 1 x 104 cells/well (0.1 mL/well). A plate containing medium 
alone is used as a control. The cells are incubated for 3 days at 37 °C 
5 and 20 jjL of Alamar Blue is added to the assay well(s). After 7 h of 
incubation, cell killing is measured using an EL-310 plate reader at 570 
and 600 nm. Values for cell killing are expressed as the percentage 
reduction in cell numbers relative to the media controls. 

EXAMPLE 7 

10 in vivo efficacy of Conjugates 

Tumor cells are trypsinized, resuspended in the growth medium and 
centrifuged for 6 min at 200xg. The cells are resuspended in serum-free 
cr-MEM and counted. The appropriate volume of this solution containing 
the desired number of cells is then transferred to a conical centrifuge 

15 tube, centrifuged as before and resuspended in the appropriate volume of 
a cold 1:1 mixture of a-MEM-Matrigel. The suspension is kept on ice 
until the animals are inoculated. 

Male nude mice 10 weeks of age are used. Mice are individually 
weighed and assigned to groups (n = 10 per group) with no more than a 

20 2-gram difference in weight between individual mice within each group. 
On day 1, mice are inoculated subcutaneously with the tumor cell line, 
ach mouse is inoculated with, for example, 0.5 mL of 0.5 x 10® to 10^ 
tumor cells/mL in a 60% solution of ice-cold Matrigel and a-MEM. Then, 
24 h later, conjugate administration began. Vehicle-treated mice are 

25 injected with 5% dextrose in water. At the end of a predetermined time, 
such as 1 8 days to two months or more, the mice are sacrificed, and 
tumor size and mass or other parameters are measured. Tumor size and 
mass or the other parameters for conjugate-treated mice are compared to 
vehicle-treated mice to determine efficacy of the conjugate. 



wo 02/095007 



PCT/US02/16819 



-199- 

Following inoculation with the tumor cells the mice are treated 
under one of three protocols: 
Protocol A 

One day after cell inoculation the animals are dosed with 1 to 100, 
5 or 3 to 50, or 5 to 25, or 7 to 22 //mol/kg, including 7.2 or 17.9 /ymol/kg, 
of test conjugate, unmodified cytotoxic agent or vehicle control (sterile 
water). Dosages of the conjugate and cytotoxic agent are initially the 
maximum non-lethal amount, but can be subsequently titrated lower. 
Identical doses are administered at 24 hour intervals for 5 days. At the 
10 end of 5.5 weeks or other suitable interval, the mice are sacrificed and 
weights of any tumors present are measured. The animals' weights are 
determined at the beginning and end of the assay. 
Protocol B 

At 14-15 days after cell inoculation, the animals are dosed with 1 
15 to 100, or 3 to 50, or 5 to 25, or 7 to 22 //mol/kg, including 7.2 or 17.9 
/;mol/kg, of test conjugate, unmodified cytotoxic agent, or vehicle control 
(sterile water). Dosages of the conjugate and cytotoxic agent are initially 
the maximum non-lethal amount, but can be subsequently titrated lower. 
Identical doses are administered at 24 hour intervals for 5 days. At the 
20 end of 5.5 weeks or other suitable interval, the mice are sacrificed and 
weights of any tumors present are measured. The animals' weights are 
determined at the beginning and end of the assay. 

Protocol C 

One day after cell inoculation, the animals are dosed by 
25 interperitoneal administration with 1 to 100, or 3 to 50, or 5 to 25, or 7 
to 22 yi/mol/kg, including 7.2 or 17.9 //mol/kg, of test conjugate, 
unmodified cytotoxic agent, or vehicle control (sterile water). Dosages of 
the conjugate and cytotoxic agent are initially the maximum non-lethal 
amount, but can be subsequently titrated lower. Identical doses are 
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administered at 7 day intervals for 5 weeks. At the end of 5.5 weeks or 

other suitable interval, the mice are sacrificed and weights of any tumors 

present are measured. The animals' weights are determined at the 

beginning and end of the assay. 

5 EXAMPLE 8 

Gene expression profiles of exemplary MTSPs and Domain organization 

Gene expression profile of MTSP1 in normal tissues, cancer cells 
and cancer tissues 

To obtain information regarding the tissue distribution and gene 
10 expression level of MTSP1, the DNA insert from a Pichia pastoris 

expression vector, pPIC9K-MTSP1 , containing the encoding nucleic acid, 
was used to probe a blot containing RNA from 76 different human tissues 
(catalog number 7775-1; human multiple tissue expression (MTE) array; 
CLONTECH, Palo Alto, CA). Significant expression was observed in the 

15 colon (ascending, transverse and descending), rectum, trachea, 

esophagus and duodenum. Moderate expression levels were observed in 
the jejunum, ileum, ilocecum, stomach, prostate, pituitary gland, 
appendix, kidney, lung, placenta, pancreas, thyroid gland, salivary gland, 
mammary gland, fetal kidney, and fetal lung. Lower expression levels 

20 were seen in the spleen, thymus, peripheral blood leukocyte, lymph node, 
bone marrow, bladder, uterus, liver, adrenal gland, fetal heart, fetal liver, 
fetal spleen, and fetal thymus. A significant amount of the MTSPl 
transcript was also detected in colorectal adenocarcinoma cell line 
(SW480), Burkitt's lymphoma cell line (Daudi), and leukemia cell line (HL- 

25 60). RT-PCR of the MTSP1 transcript in several human primary tumors 
xenografted in athymic nude mice was performed using gene-specific 
primers. A high level of MTSP1 transcript was detected in colon 
adenocarcinoma (CX-1) and pancreatic adenocarcinoma (GI-103). 
Moderate levels were observed in another colon adenocarcinoma 

30 (GI-112), ovarian carcinoma (GI-102), lung carcinoma (LX-1), and breast 



wo 02/095007 



PCT/US02/16819 



-201- 

carcinoma (GI-101). Another lung carcinoma (GI-117) expressed a low 

level of the MTSP1 transcript. A similar RT-PCR was performed to detect 

the presence of the MTSP1 transcript in PC-3 and LNCaP cell lines. Both 

cell lines expressed significant amounts of MTSP1 transcript. MTSPI 

5 also is a marker for ovarian cancer. 

Gene expression profile of the serine protease MTSP3 in normal 
and tumor tissues 

To obtain information regarding the tissue distribution of the 

10 MTSP3 transcripts, the DNA insert encoding the MTSP3 protease domain 
was used to probe a RNA blot composed of 76 different human tissues 
(catalog number 7775-1; human multiple tissue expression (MTE) array; 
CLONTECH, Palo Alto, CA). The expression pattern observed in 
decreasing signal level was: trachea = colon (descending) = esophagus 

15 > colon (ascending) > colon (transverse) = rectum > ileum > 

duodenum > jejunum > bladder > ilocecum > stomach > kidney > 
appendix. It also is expressed less abundantly in fetal kidney, and in two 
tumor cell lines, HeLa S3 and leukemia, K-562. Northern analysis using 
RNA blots (catalog numbers 7780-1, 7765-1 & 7782-1; human 12-lane, 

20 human muscle and human digestive system multiple tissue northern 
(MTN) blots; CLONTECH) confirmed that the expression was detected 
most abundantly in the colon, moderately in the esophagus, small 
intestine, bladder and kidney, and less abundantly in stomach and 
rectum. A single transcript of ^2.2 kb was detected. 

25 Amplification of the MTSP3 transcript in several human primary 

tumors xenografted in mouse was performed using gene-specific primers. 
The MTSP3 transcript was detected in lung carcinoma (LX-1), colon 
adenocarcinoma (CX-1), colon adenocarcinoma (GI-112) and ovarian 
carcinoma {GI-102). No apparent signal was detected in another form of 
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lung carcinoma (GI-117), breast carcinoma (GI-101), pancreatic 
adenocarcinoma (GI-103) and prostatic adenocarcinoma (PCS). 

Gene expression profile of MTSP4 in normal and tumor tissues 
To obtain information regarding the gene expression profile of the 
5 MTSP4 transcript, a DNA fragment encoding part of the LDL receptor 
domain and the protease domain was used to probe an RNA blot 
composed of 76 different human tissues (catalog number 7775-1; human 
multiple tissue expression (MTE) array; CLONTECH}. As in the northern 
analysis of gel blot, a very strong signal was observed in the liver. 

10 Signals in other tissues were observed in (decreasing signal level): fetal 
liver > heart = kidney = adrenal gland = testis = fetal heart and kidney 
= skeletal muscle = bladder = placenta > brain = spinal cord = colon 
= stomach = spleen = lymph node = bone marrow = trachea = uterus 
= pancreas - salivary gland = mammary gland = lung. MTSP4 also is 

15 expressed less abundantly in several tumor cell lines including HeLa S3 = 
leukemia K-562 = Burkitt's lymphomas (Raji and Daudi) = colorectal 
adenocarcinoma (SW480) > lung carcinoma {A549) = leukemia MOLT-4 
= leukemia HL-60. PGR of the MTSP4 transcript from cDNA libraries 
made from several human primary tumors xenografted in nude mice 

20 (human tumor multiple tissue cDNA panel, catalog number K1 522-1, 

CLONTECH) was performed using MTSP4-speciflc primers. The MTSP4 
transcript was detected in breast carcinoma (GI-101), lung carcinoma 
(LX-1), colon adenocarcinoma (GI-1 12) and pancreatic adenocarcinoma 
(GI-103). No apparent signal was detected in another form of lung 

25 carcinoma (GI-1 17), colon adenocarcinoma (CX-1), ovarian carcinoma 
(GI-1 02). and prostatic adenocarcinoma (PCS). The MTSP4 transcript 
was also detected in LNCaP and PC-3 prostate cancer cell lines as well as 
in HT-1080 human fibrosarcoma cell line. 
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Gene expression profile of MTSP6 in normal and tumor tissues 
To obtain information regarding the gene expression profile of the 
MTSP6 transcript, a 495 bp DNA fragment obtained from PGR reaction 
with primers Ch17-NSP-3 and NSP-4AS was used to probe an RNA blot 
5 composed of 76 different human tissues (catalog number 7775-1; human 
multiple tissue expression (MTE) array; CLONTECH). The strongest signal 
was observed In duodenum. Signals in other tissues were observed in 
(decreased signal level): Stomach > trachea = mammary gland = 
thyroid giand = salivary gland = pituitary gland = pancreas > kidney > 

10 lung > jejunum = ileum = ilocecum = appendix = fetal kidney > fetal 
lung. Very weak signals also can be detected in several other tissues. 
MTSP6 also is expressed in several tumor cell lines including HeLa S3 > 
colorectal adenocarcinoma (SW480) > leukemia MOLT-4 > leukemia 
K-562. PGR analysis of the MTSP6 transcript from cDNA libraries made 

15 from several human primary tumors xenografted in nude mice (human 

tumor multiple tissue cDNA panel, catalog number K1 522-1, CLONTECH) 
was performed using MTSP6-specific primers (Gh17-NSP-3 and 
Gh17-NSP2AS). The MTSP6 transcript was strongly detected in lung 
carcinoma (LX-1), moderately detected in pancreatic adenocarcinoma 

20 (GI-103), weakly detected in ovarian carcinoma (GI-102); and very 
weakly detected in colon adenocarcinoma (GI-1 12 and CX-1), breast 
carcinoma (GI-1 01), lung carcinoma (Gl-117) and prostatic 
adenocarcinoma (PC3). The MTSP6 transcript was also detected in 
breast cancer cell line MDA-MB-231, prostate cancer cell line PC-3, but 

25 not in HT-1080 human fibrosarcoma cell line. MTSP6 also is expressed 

in mammary gland carcinoma cDNA (Clontech). 

Gene expression profile of MTSP9 in normal, tumor tissues and 
cell lines 

To obtain a gene expression profile of the iVlTSPS transcript, the 
30 MTSP9 cDNA fragment obtained from human pancreas was used to 
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probe a dot blot composed of RNA extracted from 76 different human 
tissues {Human Multiple Tissue Expression (MTE) Array; Clontech, Palo 
Alto, CA; catalog no. 7775-1). The results of this analysis indicate that 
MTSP9 is highly expressed in the esophagus and expressed at a low level 
5 in many other tissues. The MTSP9 transcript is found in kidney {adult 
and fetal), spleen (adult and fetal), placenta, liver (adult and fetal), 
thymus, peripher-al blood leukocyte, lung (adult and fetal), pancreas, 
lymph node, bone marrow, trachea, uterus, prostate, esophagus, testes, 
ovary and the gland organs {mammary, adrenal, thyroid, pituitary and 

10 salivary). MTSP9 also is expressed in tumor esophagus tissues, in a lung 
carcinoma (A549 cell line) and, at a low level, in a colorectal carcinoma 
(SW480), lymphoma (Raji and Daudi), a cervical carcinoma (HeLaS3) and 
leukemia (HL-60, K-562 and MOLT-4) cell lines. 

Gene expression profile of MTSP1 0 In normal and tumor tissues 

15 To obtain information regarding the gene expression profile of the 

MTSP10 transcript, PGR analysis was carried out on cDNA panels made 
from several human adult tissues {Clontech, Cat. #K 1420-1) cDNA panel 
using MTSP1 0-specific primers. MTSP10 transcript was detected in 
pancreas, lung and kidney. MTSP10 transcript was also detected in small 

20 intestine Marathon-Ready cDNA (Clontech). PCR of the MTSPIO 

transcript from cDNA libraries made from several human primary tumors 
xenografted in nude mice (human tumor multiple tissue cDNA panel, 
catalog number K 1522-1, CLONTECH) was also performed. The 
MTSPIO transcript was detected in breast carcinoma (GI-IOI), lung 

25 carcinoma {LX-1 and Gl-1 17), ovarian carcinoma (GI-102), and pancreatic 
adenocarcinoma (Gl-1 03). The MTSPIO transcript can be weakly 
detected in prostatic adenocarcinoma (PCS). No apparent signal was 
detected in two forms of colon adenocarcinomas (Gl-1 12 and CX-1). The 
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MTSP10 transcript was also detected in CWR22R prostate tumor grown 
on nude mice. 

Domain organization and gene expression profile of MTSP12 in 
normal and tumor tissues 

5 Domain organization of MTSP12PD1, -PD2 and -PD3 and 

homology to other serine proteases 

Sequence and protein domain analyses of the translated 

MTSP12PD1, -PD2 and -PD3 nucleotide sequences indicate that these 

three serine proteases are contiguous. The sequence order is as follows: 

10 MTSP12-PD1 is found at the N terminus followed by MTSP12-PD2, and 
MTSP12-PD3 is at the C terminus. MTSP12-PD1 and -PD2 contain a 
trypsin-Iike serine protease domain (aa 236 to aa 465 and aa 537 to aa 
765 for I\/1TSP12-PD1 and -PD2, respectively) characterized by the 
presence of a protease activation cleavage site (...R236 4 1237VGGMEAS..,, 

15 and ... R537 IVsggVGGFGAA..., for l\/ITSP12-PD1 and -PD2, respectively, 
and where | indicates a protease activation cleavage site) and the 
catalytic triad residues (Hisj??, ASP326 and Ser42i in MTSP12-PD1; HiSgyg/ 
Asp626 anc* Ser72i in MT5P12-PD2) in 3 highly-conserved regions of the 
catalytic domain. MTSP12-PD3 contains a serine protease domain (aa 

20 861 to aa 1087); it has a protease activation cleavage site 

{...Rbso* 'seiVGGSAAG...) and has the catalytic Hisgojand Aspg49, but it 
has a Alaio43 instead of the conserved catalytic serine found in serine 
proteases. Several domains are found upstream of the MTSP12-PD1 
serine protease domain and these include a transmembrane domain {aa 

25 28 to aa 50), a SEA (sea urchin sperm protein-enterokinase-agrin) domain 
(aa 51 to aa 170) and an LDLa Qow density lipoprotein receptor class a) 
domain {aa 187 to aa 225). There are 5 possible A/-linked glycosylation 
sites {NiigSS, NgeiHT, Ng^aAT, NggyPS and NgsoST), In the protease 
domain of MTSP12-PD1, there is an unpaired cysteine {C34g) in a single 

30 chain form of the protease domain and the following Cys pairings are 
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noted: C262"C278/ C36o"^427' 7*^446' ^392'^406- '"^ protsase domain of 
MTSP12-PD2, there is an unpaired cysteine {Ce46) in a single chain form 
of the protease domain, and the following Cys pairings are noted: C5Q3- 
^579/' Cgeo'Cya?; Ce92-C706; C717-C746. In the protease domain of MTSP12- 
5 PD3, there is an unpaired cysteine (Cggg) in a single chain form of the 
protease domain, and the following Cys pairings are noted: C^sy-C^Qj; 

^983~^1049' ^1014"Cl028' ^t039~^1068- 

Alignment {blastp; http://www.ncbi.nlm.nih.gov/Bl_AST) of the 
respective l\/ITSP12-PD1 , MTSP1 2-PD2 and MTSP12-PD3 protein 

10 sequences to known serine proteases deposited in the public database 
showed a 45%, 45% and 48% identity to matriptase, a 44%, 43% and 
41% identity with DESCI/endotheliase 1, a 44%, 43% and 48% identity 
to prostamin {AB030036), a 43%, 39% and 39% identity to spinesin 
(TMPRSS5; NM_030770), and a 40%, 38% and 38% identity to 

15 marapsin (NM_031 948). The clone has about 93% homology at the 
nucleotide and encoded protein levels to a clone and encoded provided 
described in International PCT application No. WO 02/00860 {see SEQ ID 
Nos. 38 and 97 therein). The encoded protein described in the PCT 
application, however, includes the Sequence set forth in SEQ ID No. 271 

20 between amino acids Leu373 and Val374 of SEQ ID No. 20, as well as 
an additional extended sequence of amino acids beteween amino acids 
Ala48 and Phe49 of SEQ ID No. 20 and lacks amino acids 91-124 of SEQ 
ID No. 20. The protein provided in Internationa! PCT application 
N0.WOO2/OO86O can be used in the methods provided herein. 

25 Gene and Tissue expression profile of MTSP12 

To obtain information regarding the tissue distribution profile of the 
MTSP12PD1, -PD2 and -PD3 transcripts, 3 cDNA probes were prepared. 
Data indicate that the MTSP1 2PD1 , -PD2 and -PD3 transcript is 
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expressed at a low level in most of the 76 tissues and cell lines, but at a 
higher level in the lymph node and testes. 

To compare the expression profile of MTSP12PD1, -PD2 and -PD3 
in a range of normal human and matched tumor tissues, a matched 
5 tumor/normal expression array (catalog number 7840-1; 

http://www.clontech.com) composed of 68 paired cDNA samples from 
individual patients was used. Results show that the MTSP12PD1, -PD2 
and -PD3 transcript is expressed at a low level in a number of normal 
tissues including breast, uterus, colon, ovary, lung, kidney and rectum, 

10 but is not differentially expressed in any of the matched tumors. It also is 
expressed at a low level in several tumor cell lines, including HeLa 
(cervical carcinoma), Daudi (Burkitt's lymphoma), K562 (chronic 
myelogenous leukemia), HL-60 (premyelocytic leukemia), G361 
(melanoma), A549 (lung carcinoma), MOLT-4 (lymphoblastic leukemia), 

15 SW480 (colorectal adenocarcinoma), and Raji (Burkitt's lymphoma). 

Several SMART™ 5'-RACE cDNA libraries (catalog number K1811- 
1; http://www.clontech.com) prepared from normal breast, normal testes, 
normal prostate, prostate cancer cell lines and breast cancer cell lines 
were analyzed for the presence of MTSP12PD1, -PD2 and -PD3 transcript 

20 by RT-PCR using two sets of gene-specific primers. The MTSP12-PD2 
and -PD3 transcript was detected in normal prostate, PC-3, LNCaP, 
normal breast, MDA-MB-231, MDA-MB-361 , MDA-l\/IB-453 and DU4475, 
but higher levels were observed in normal breast and MDA-MB-231 . The 
MTSP12-PD1 transcript was detected in the same tissues and cell lines, 

25 except higher levels were observed in normal breast, MDA-MB-231 and 
DU4475. 
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Gene expression profile of MTSP20 in normal, tumor tissues and 
cell lines 

To obtain information regarding the gene expression profile of the 
MTSP20 transcript, the MTSP20 cDNA fragment obtained from human 
5 lung tissue was used to probe a dot blot composed of RNA extracted 
from 76 different human tissues (Human Multiple Tissue Expression 
(MTE) Array; Clontech, Palo Alto, CA; catalog no. 7775-1). The results 
indicate that RNA encoding MTSP20 is expressed in a variety of tissues. 
The MTSP20 transcript is found in liver, lymph node, cerebellum, 

10 pancreas, prostate, uterus, testis, glands (adrenal, thyroid and salivary), 
thymus, kidney and spleen. Lower transcript level can be found in lung, 
placenta, bladder, ovary, digestive system, circulatory system and other 
parts of the the brain. MTSP20 is also expressed in certain tumor cell 
lines including lung carcinoma (A519), colorectal carcinoma (SW480), 

15 lymphoma (Raji and Daudi), cervical carcinoma (HeLaS3) and leukemia 

(HL-60, K-562 and MOLT-4) cell lines. 

Gene expression profile of MTSP22 In normal, tumor tissues and 
cell lines 

MTSP22 is expressed in the uterine tissue, thymus, adipose tissue, 
20 and lymph node. It may also be expressed in lung, stomach, uterine, 
breast, ovarian, prostate and in other tumors. To obtain information 
regarding the gene expression profile of the MTSP22 transcript, the cDNA 
fragment encoding the entire serine protease domain was used to probe a 
dot blot composed of RNA extracted from 72 different human tissues 
25 (Human Multiple Tissue Expression (MTE) Array; Clontech, Palo Alto, CA; 
catalog no. 7776-1) as well as a dot blot composed of normalized cDNA 
from 241 tumor and corresponding normal tissues from individual patients 
(Cancer Profiling Array, Clontech, catalog no. 7841-1). The results of 
MTE analysis indicated that MTSP22 transcript is expressed primarily in 
30 the esophagus. In the cancer profiling array analysis, MTSP22 is 
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highly expressed in 3 of the 42 normal uterus tissue samples, but not in 
their matched tumor samples. In one of the 42 uterus samples, MTSP22 
is expressed in tumor and its metastatic tissues, but not in the normal 
tissue counterpart. MTSP22 is also expressed in 2 of the 17 stomach 
5 tumors and 2 of the 21 lung tumors, but not in their normal tissue 

counterparts. MTSP22 is also expressed In the normal tissue of the only 
pancreas matched cDNA pair. PCR analysis was also performed using 
commercially available cDNA panel from several human adult tissues 
{Clontech, Cat. #K1420-1 and K1420-2) and primary tumors (Clontech 
10 Cat. #K1 522-1) as well as several Marathon-Ready cDNAs (Clontech). 

I\/ITSP22 cDNA was detected in thymus, adipose tissue, and lymph 
node. Serine protease domain of MTSP22 and homology to other 
proteases. 

Sequence analysis of the translated MTSP22 protease domain 

15 sequence revealed that MTSP22 contains a trypsin-like serine protease 

domain characterized by the presence of a protease activation cleavage 

site at the amino terminus of the domain and the catalytic triad residues 

(histidine, aspartate and serine) in three highly-conserved regions. 

Alignment of the protein sequence with that of endotheliase 1 (same as 

20 serine protease DESC1 protein; GenBank accession number AF064819) 

indicated that the two proteins share 50% sequence identity in their 

protease domains. 

Gene expression profile of MTSP25 in normal, tumor tissues and 
cell lines 

25 MTSP25 is expressed in breast, colon, uterine, ovarian, kidney, 

prostate, testicular cancer tissue. It may also be expressed in lung, 
stomach, prostate and in other tumors. To obtain information regarding 
the gene expression profile of the MTSP25 transcript, a 369 bp DNA 
fragment containing MTSP25 protease domain sequence (obtained from a 

30 PCR reaction) was used to probe a dot blot composed of RNA extracted 
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from 72 different human tissues (Human Multiple Tissue Expression 
(MTE) Array; Ciontech, Palo Alto, CA; catalog no. 7776-1) as well as a 
dot blot composed of normalized cDNA from 241 tumor and 
corresponding normal tissues from individual patients (Cancer Profiling 
5 Array, Ciontech, catalog no. 7841-1). The results of MTE analysis 

indicate that MTSP25 transcript is expressed weakly in the lymph node. 
In the cancer profiling array analysis, MTSP25 is highly expressed in all 4 
prostate samples (in normal and cancer samples). In one of the 20 
kidney cDNA pairs, MTSP25 is highly expressed in the tumor sample, 
10 but not in its normal tissue counterpart. MTSP25 is also expressed in 1 
of the 50 breast cancer samples, but not in its normal tissue counterpart. 

MTSP25 is also expressed in 3 of the 42 normal uterus samples, 
but not in their tumor counterparts. MTSP25 expression is also detected 
in 3 of the 14 ovarian cancer samples. Among these three samples, the 
15 expression of MTSP25 was also detected in one of the matched normal 
tissue counterparts. MTSP25 expression was also detected in 5 tumor 
samples in the 34 colon cDNA pairs. 

PCR analysis was also performed using a commercially available 
cDNA panel from several human adult tissues (Ciontech, Cat. #K1 420-1 
20 and Kl 420-2) as well as several Marathon-Ready cDNAs (Ciontech). 
MTSP25 cDNA was strongly detected in testis and mammary gland 
adenocarcinoma, weakly detected in brain, placenta, lung, spleen, 
prostate, small intestine, colon, and leukocyte, and very weakly detected 
in heart, liver, and pancreas. 
25 EXAMPLE 9 

Conjugates that have been prepared according to the procedures of 
Examples 1-4 by routine and minor modification of the procedures, such 
as using different Fmoc-amino acid building blocks, include: 
Ac-R-Q-G-R-S-L-(Dox) (SEQ ID NO: 491); 
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Ac-R-Q-G-R-S-S-L-(Dox) (SEQ iD NO: 492); 

Ac-R-Q-G-R-S-nL-(Dox) (SEQ !D NO: 493); 

Ac-R-Q-G-R-S-nV-(Dox) (SEQ ID NO: 494); 

Ac-R-Q-G-R-S-F-(Dox) (SEQ ID NO: 495); 
5 Ac-R-Q-G-R-A-L-(Dox) (SEQ ID NO: 496); 

Ac-R-Q-G-R-A-L-(Dox) (SEQ ID NO: 497); 

Ac-R-Q-G-R-A-nL-(Dox) (SEQ ID NO: 498); 

Ac-R-Q-G-R-A-nL-(Dox) (SEQ ID NO: 499); 

Ac-R-Q-G-R-A-nV-(Dox) (SEQ ID NO: 500); 
10 Ac-R-Q-G-R-A-Cha-(Dox) (SEQ ID NO: 501); 

Ac-R-Q-G-R-A-F-(Dox) (SEQ ID NO: 502); 

Ac-R-N-G-R-S-L-(Dox) (SEQ ID NO: 503); 

Ac-R-N-G-R-A-nL-(Dox) (SEQ ID NO: 504); 

Ac-R-Q-A-R-S-L-(Dox) (SEQ ID NO: 505); 
15 Ac-R-Q-A-R-S-nL-(Dox) (SEQ ID NO: 506); 

Ac-R-Q-A-R-S-nV-(Dox) (SEQ ID NO: 507); 
Ac-R-Q-A-A-S-Cha-{Dox) (SEQ ID NO: 508); 
Ac-R-Q-A-R-S-S-Cha-(Dox) (SEQ ID NO: 509); 
Ac-R-Q-A-R-T-nL-(Dox) (SEQ ID NO: 510); 
20 Ac-R-Q-A-R-A-U(Dox) (SEQ ID NO: 511); 
Ac-R-Q-A-R-A-nL-(Dox) (SEQ ID NO: 513); 
Ac-R-Q-A-R-A-nV-(Dox) (SEQ ID NO: 514); 
Ac-R-Q-A-R-A-Cha-(Dox) (SEQ ID NO: 515); 
Ac-R-Q-S-R-A-A-(Dox) (SEQ ID NO: 516); 
25 Ac-R-Q-S-R-A-(Dox) (SEQ ID NO: 517); 
Ac-R-Q-S-R-A-nL-(Dox) (SEQ ID NO: 518); 
Ac-R-Q-S-R-A-L-{Dox) (SEQ ID NO: 519); 
Ac-R-Q-S-R-A-nV-(Dox) (SEQ ID NO: 520); 
Ac-R-Q-S-R-A-Cha-(Dox) (SEQ ID NO: 521); 
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Ac-R-Q-S-R-S-S-L-(Dox) (SEQ ID NO: 522); 

Ac-R-Q-S-R-S-L-(Dox) (SEQ ID NO: 523); 

Ac-R-Q-S-R-S-dnL-(Dox) (SEQ ID NO: 524); 

Ac-R-Q-S-R-S-nL-(Dox) (SEQ ID NO: 525); 
5 Ac-R-Q-S-R-S-nV-(Dox) (SEQ ID NO: 526); 

Ac-R-Q-S-R-S-allylG-(Dox) (SEQ ID NO: 527); 

Ac-R-a-S-R-S-Cha-(Dox) (SEQ ID NO: 528); 

Ac-R-Q-S-R-T-nL-(Dox) (SEQ ID NO: 529); 

Ac-R-Q-T-R-S-S-L-(Dox) (SEQ ID NO: 530); 
10 Ac-R-Q-T-R-S-L-(Dox) (SEQ ID NO: 531); 

Ac-R-N-S-R-S-nL-(Dox) (SEQ ID NO: 532); 

Ac-R-Q-F-R-S-L-(Dox) (SEQ ID NO: 533); 

Ac-R-Q-F-R-S-nL-(Dox) (SEQ ID NO: 534); 
Ac-R-Q-F-R-S-nV-(Dox) (SEQ ID NO: 535); 
15 Ac-R-a-F-R-S-nL-(Dox) (SEQ ID NO: 536); 
Ac-R-Q-F-R-S-Cha-(Dox) (SEQ ID NO: 537); 
Ac-R-Q-F-R-A-L-(Dox) (SEQ ID NO: 538); 
Ac-R-Q-F-R-A-nL-(Dox) (SEQ ID NO: 539); 
Ac-R-Q-F-R-A-nV-(Dox) (SEQ ID NO: 540); 
20 Ac-R-Q-F-R-A-Cha-(Dox) (SEQ ID NO: 541); 
Ac-Q-S-R-S-S-nL-(Dox) (SEQ ID NO: 542); 
MeOCO-Quat2-G-R-S-L-NH2 (SEQ ID NO: 483); 
MeOCO-Quat3-G-R-S-L-NH2 (SEQ ID NO: 484); 
MeOCO-Quat-G-R-S-L-NH2 (SEQ ID NO: 485); 
25 MeOCO-Quat4-G-R-S-L-NH2 (SEQ ID NO: 486); 
MeOCO-Quat5-G-R-S-L-NH2 (SEQ ID NO: 487); 
MeOCO-Quat2-G-R-S-S-L-NH2 (SEQ ID NO: 488); 
MeOCO-Quat4-G-R-S-L-(Dox) (SEQ ID NO: 489); 
MeOCO-Quat2-G-R-S-L-(Dox) (SEQ ID NO: 490); 
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Ac-Q-G-R-S-L-(Dox) (SEQ ID NO: 445); 

Ac-Q-G-R-S-S-L-(Dox) (SEQ ID NO; 446); 

Ac-a-G-R-A-S-L-(Dox) (SEQ ID NO: 447); 

Ac-N-G-R-S-S-L-(Dox) (SEQ ID NO: 448); 
5 Ac-Q-G-R-S-S-nL-(Dox) (SEQ ID NO: 449); 

Ac-Q-G-R-S-S-nV-(Dox) (SEQ ID NO: 450); 

Ac-Q-G-R-S-S-Cha-(Dox) (SEQ ID NO: 451); 

Ac-Q-G-R-S-S-allylG-(Dox) (SEQ ID NO: 452); 

Ac-Q-G-R-S-S-allylG-(Dox) (SEQ ID NO: 453); 
10 Ac-Q-A-R-S-L-(Dox) (SEQ ID NO: 454); 

Ac-Q-A-R-S-S-L-(Dox) (SEQ ID NO: 455); 

Ac-Q-S-R-S-L-(Dox) (SEQ ID NO: 456); 

Ac-Q-S-R-S-S-nV-(Dox) (SEQ ID NO: 457); 

Ac-Q-S-R-S-S-Cha-(Dox) (SEQ ID NO: 458); 
15 Ac-Q-S-R-S-S-L-(Dox) (SEQ ID NO: 459); 

Ac-Q-T-R-S-S-L-(Dox) (SEQ ID NO: 460); 

Ac-Q-Aib-R-S-S-Cha-(Dox) (SEQ ID NO: 461); 

Ac-Q-Aib -R-S-S-L-(Dox) (SEQ ID NO: 462); 

Ac-Q-Abu-R-S-S-Cha-(Dox) (SEQ ID NO: 463); 
20 Ac-Q-Abu-R-S-S-L-{Dox) (SEQ ID NO: 464); 

Ac-Q-Cha-R-S-S-Cha-(Dox) (SEQ ID NO: 465); 

Ac-Q-F-R-S-L-(Dox) (SEQ ID NO: 466); 

Ac-Q-F-R-S-S-L-(Dox) (SEQ ID NO: 467); 

Ac-Q-Y-R-S-S-L-(Dox) (SEQ ID NO: 468); 
25 Ac-R-G-R-S-L-(Dox) (SEQ ID NO: 469); 

Ac-R-G-R-S-S-L-(Dox) (SEQ ID NO: 470); 

Ac-R-G-R-S-S-Cha-{Dox) (SEQ ID NO: 471); 
Ac-R-G-R-S-Cha-(Dox) (SEQ ID NO: 472); 
Ac-R-A-R-S-L-(Dox) (SEQ ID NO: 473); 
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Ac-R-A-R-S-S-L-{Dox) (SEQ ID NO: 474); 

Ac-R-S-R-S-L-lDox) (SEQ ID NO: 475); 

Ac-R-S-R-S-S-L-{Dox) (SEQ ID NO: 476); 

Ac-R-S-R-S-Cha-(Dox) (SEQ ID No': 477); 
5 Ac-R-S-R-S-S-Cha-(Dox) (SEQ ID NO: 478); 

Ac-R-F-R-S-L-(Dox) (SEQ ID NO: 479); 

Ac-R-F-R-S-Cha-(Dox) (SEQ ID NO: 480); 

Ac-Y-G-R-S-S-L-(Dox) (SEQ ID NO: 481); 

Ac-M(02)-S-R-S-L-(Dox) (SEQ ID NO: 482); 
10 Ac-R-R-Q-S-R-A-A-(Dox) (SEQ ID NO: 105); 

Ac-R-R-Q-S-R-l-(Dox) (SEQ ID NO: 610); 

Ac-R-R-Q-S-R-S-S-L-{Dox) (SEQ ID NO: 543); 

Ac-R-R-Q-S-R-S-L-{Dox) (SEQ ID NO: 544); 

Ac-R-G-S-G-R-S-L-(Dox) (SEQ ID NO: 545); 
15 Ac-R-G-S-G-R-S-nL-(Dox) (SEQ ID NO: 546); 

Ac-R-G-S-G-R-A-nL-{Dox) (SEQ ID NO: 547); 

Ac-R-G-S-G-R-S-S-L-(Dox) (SEQ ID NO: 548); 

Ac-l-V-S-G-R-A-S-L-(Dox) (SEQ ID NO: 549); 

Ac-R-R-Q-S-R-A-{Dox) (SEQ ID NO: 108); 
20 Ac-R-R-Q-S-R-l-(Dox) (SEQ ID NO: 111); 

Ac-L-R-R-Q-S-R-A-A-{Dox) (SEQ ID NO: 106); 

Ac-L-R-R-Q-S-R-G-G-(Dox) (SEQ ID NO: 109); 

Ac-L-R-R-Q-S-R-A-(Dox) (SEQ ID NO: 110); 

Ac-L-R-R-Q-S-R-A-l-(Dox) (SEQ ID NO: 112); 
25 Ac-L-R-R-Q-S-R-A-l-(Dox) (SEQ ID NO: 61 1); 

Ac-L-R-R-Q-S-R-S-S-L-(Dox) (SEQ ID NO: 550); 

Ac-L-R-R-Q-S-R-S-L-(Dox) (SEQ ID NO: 551); 

Ac-S-G-R-S-L-(Dox) (SEQ ID NO: 362); 

Ac-S-G-R-S-S-L-(Dox) (SEQ ID NO: 363); 
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Ac-S-G-R-S-S-S-L-{Dox) (SEQ ID NO: 364); 

Ac-S-G-R-S-nL-(Dox) {SEQ ID NO: 365); 

Ac-S-G-R-S-nV-{Dox) (SEQ ID NO: 366); isomer 1 

Ac-S-G-R-S-nV-{Dox) (SEC ID NO: 3671; isomer 2 
5 Ac-S-G-R-S-G(hex)-(Dox) (SEQ ID NO: 368); 

Ac-S-G-R-S-Cha-(Dox) {SEQ ID NO: 369); 

Ac-S-G-R-S-hCha-{Dox) {SEQ ID NO: 370); 

Ac-S-A-R-S-L-{Dox) {SEQ ID NO: 371); 

Ac-S-A-R-S-S-L-{Dox) {SEQ ID NO: 372); 
10 Ac-S-S-R-S-nL-(Dox) (SEQ ID NO: 373); 

Ac-T-G-R-S-Abu-(Dox) (SEQ ID NO: 374); 

Ac-T-G-R-S-L-(Dox) (SEQ ID NO: 375); 

Ac-T-G-R-S-nV-(Dox) (SEQ ID NO: 376); 

Ac-T-G-R-S-nL-(Dox) (SEQ ID NO: 377); 
15 Ac-T-G-R-S-G(hex)-(Dox) (SEQ ID NO: 378); 
Ac-T-G-R-S-Cha-(Dox) (SEQ ID NO: 379); 
Ac-T-G-R-S-hCha-(Dox) (SEQ ID NO: 380); 
Ac-T-G-R-T-Abu-{Dox) (SEQ ID NO: 381); 
Ac-T-G-R-hS-nL-(Dox) (SEQ ID NO: 382); 
20 Ac-T-G-R-Abu-nL-{Dox) (SEQ ID NO: 383); 
Ac-T-G-R-Abu-nV-{Dox) {SEQ ID NO: 384); 
Ac-T-G-F{Gn)-S-nL-(Dox) (SEQ ID NO: 385); 
Ac-T-G-F(Gn)-S-Cha-(Dox) (SEQ ID NO: 386); 
Ac-T-G-F(Gn)-Abu-nV-{Dox) (SEQ ID NO: 387); 
25 Ac-T-G-K(alloc)-S-nL-(Dox) (SEQ ID NO: 388); 
Ac-T-G-K-S-nL-(Dox) (SEQ ID NO: 389); 
Ac-T-G-hR-S-nL-(Dox) (SEQ ID NO: 390); 
Ac-(hS)G-G-R-S-nL-{Dox) (SEQ ID NO: 391); 
MeOCO-T-G-R-S-nL-{Dox) (SEQ ID NO: 392); 
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PhS02-T-G-R-S-nL-(Dox) (SEQ ID NO: 393); 

MeOEtCO-T-G-R-S-nL-(Dox) {SEQ ID NO: 394); 

MeO(EtO)2Ac-T-G-R-S-nL-(Dox) (SEQ ID NO: 395); 

4-oxo-Pentanoyl-T-G-R-S-nL-(Dox) (SEQ ID NO: 396); 
5 3,4-MethyldioxyPhAc-T-G-R-S-nL-{Dox) (SEQ ID NO: 397); 

2-PyridylAc-T-G-R-S-nL-(Dox) (SEQ ID NO: 398); 

PhOAc-T-G-R-S-nL-(Dox) (SEQ ID NO: 399); 

L-3-PhLactyl-T-G-R-S-nL-(Dox) (SEQ ID NO: 400); 

MeOAc-T-G-R-S-nL-(Dox) (SEQ ID NO: 401); 
10 PhAc-T-G-R-S-nL-(Dox) (SEQ ID NO: 402); 

MeOEtOCO-T-G-R-S-nL-(Dox) (SEQ ID NO: 403); 

MeOEtOAc-T-G-R-S-nL-(Dox) (SEQ ID NO: 404); 

HOOCButa-T-G-R-S-nL-(Dox) (SEQ ID NO: 405); 

2-T-G-R-S-nL-(Dox) (SEQ ID NO: 406); 
15 EtOCO-T-G-R-S-nL-(Dox) (SEQ ID NO: 407); 

^A-T-G-R-S-nL-(Dox) (SEQ ID NO: 408); 

Pent-4-ynoyI-T-G-R-S-nL-(Dox) (SEQ ID NO: 409); 

NapAc-T-G-R-S-nL-(Dox) (SEQ ID NO: 410); 

iBoc-T-G-R-S-nL-{Dox) (SEQ ID NO: 41 1); 
20 HOAc-T-G-R-S-nL-(Dox) (SEQ ID NO: 412); 

MeSucc-T-G-R-S-nL-(Dox) (SEQ ID NO: 413); 

N,N-dil\/leGly-T-G-R-S-nL-(Dox) (SEQ ID NO: 414); 

Succ-T-G-R-S-nL-(Dox) (SEQ ID NO: 415); 

HCO-T-G-R-S-nL-(Dox) (SEQ ID NO: 416); 
25 Ac-T-A-R-S-nL-(Dox) (SEQ ID NO: 417); 

Ac-T-A-F(Gn)-S-nL-(Dox) (SEQ ID NO: 418); 

Ac-T-A-R-Abu-nV-(Dox) (SEQ ID NO: 419); 

Ac-T-A-R-S-Abu-(Dox) (SEQ ID NO: 420); 

Ac-T-A-R-T-Abu-(Dox) (SEQ ID NO: 421); 
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Ac-T-hS-R-S-nL-(Dox) (SEQ ID NO: 423); 

Ac-T-{1-Me)H-R-S-nL-(Dox) (SEQ ID NO: 424); 

Ac-T-(3-Me)H-R-S-nL-(Dox) (SEQ ID NO: 425); 
5 Ac-T-H-R-S-nL-(Dox) (SEQ ID NO: 426); 

Ac-T-Sar-R-S-nL-(Dox) (SEQ ID NO: 427); 

Ac-T-nV-R-S-nL-(Dox) (SEQ ID NO: 428); 

Ac-T-nL-R-S-nL-(Dox) (SEQ ID NO: 429); 

Ac-T-A-R-S-Cha-(Dox) (SEQ ID NO: 430); 
10 Ac-T-Abu-R-S-nL-(Dox) (SEQ ID NO: 431); 

Ac-4,4diMeThr-G-R-S-nL-(Dox) (SEQ ID NO: 432); 

Ac-hS-G-R-S-nL-(Dox) (SEQ ID NO: 433); 

Ac-hS-G-R-hS-Cha-(Dox) (SEQ ID NO: 434); 

Ac-hS-G-R-S-Cha-(Dox) (SEQ ID NO: 435); 
15 Ac-hS-G-R-T-Cha-{Dox) (SEQ ID NO: 436); 

Ac-hS-A-R-S-Cha-(Dox) (SEQ ID NO: 437); 

Ac-N-G-R-S-nL-(Dox) (SEQ ID NO: 438); 

Ac-Y-G-R-S-S-L-(Dox) (SEQ ID NO: 439); 

Ac-Y-G-R-S-Cha-(Dox) (SEQ ID NO: 440); 
20 Ac-Q-G-R-S-S-nL-(Dox) (SEQ ID NO: 441); 

Ac-Q-G-R-S-S-nV-{Dox) (SEQ ID NO: 442); 

Ac-L-R-G-S-G-R-S-A-(Dox) (SEQ ID NO: 573); 

Ac-L-R-G-S-G-R-S-L-(Dox) (SEQ ID NO: 342); 

Ac-L-R-G-S-G-R-S-L-(Dox) (SEQ ID NO: 343); 
25 Ac-L-R-G-S-G-R-S-S-nL-(Dox) (SEQ ID NO: 344); 

Ac-L-R-G-S-G-R-S-S-Cha-(Dox) (SEQ ID NO: 345); 

Ac-L-R-G-dS-A-R-S-A-(Dox) (SEQ ID NO: 574); 

Ac-L-R-G-S-A-R-S-S-L-(Dox) (SEQ ID NO: 346); 

Ac-L-R-G-S-A-R-S-L-(Dox) (SEQ ID NO: 347); 
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Ac-L-R-G-S-A-R-S-S-Cha-{Dox) (SEQ ID NO: 348); 

Ac-L-R-G-S-A-R-S-S-nV-(Dox) (SEQ ID NO: 349); 

Ac-L-R-G-S-A-R-S-S-nL-(Dox) {SEQ ID NO: 350); 

Ac-V-l-V-S-G-R-A-L-(Dox) (SEQ ID NO: 351); 
5 Ac-V-I-V-S-A-R-S-L-(Dox) (SEQ ID NO: 352); 

Ac-V-l-V-S-G-R-S-S-L-(Dox) (SEQ ID NO: 353); 

Ac-V-I-V-S-A-R-M-A-(Dox) (SEQ ID NO: 354); 

Ac-V-l-V-S-A-R-nL-A-(Dox) (SEQ ID NO: 355); 

Ac-V-l-V-S-A-R-S-nL-{Dox) (SEQ ID NO: 356); 
10 Ac-V-l-V-S-A-R-S-Cha-(Dox) (SEQ ID NO: 357); 

Ac-V-l-V-S-A-R-S-Cha-(Dox) (SEQ ID NO: 358); 

Ac-V-l-V-S-A-R-S-S-Cha-(Dox) (SEQ ID NO: 359); 

Ac-R-R-{Me)C-P-G-R-V-V-(Dox) (SEQ ID NO: 360); 

Ac-R-R-nV-P-A-R-S-L-{Dox) (SEQ ID NO: 361); 
15 Ac-R-G-dS-A-R-S-A-(Dox) (SEQ ID NO: 309); 

Ac-R-G-S-G-R-S-A-(Dox) {SEQ ID NO: 310); 

Ac-R-G-S-G-R-A-L-(Dox) (SEQ ID NO: 31 1); 

Ac-R-G-S-G-R-S-L-{Dox) (SEQ ID NO: 312); 

Ac-R-G-S-G-R-S-nL-(Dox) (SEQ ID NO: 313); 
20 Ac-R-G-S-G-R-A-nL-(Dox) {SEQ ID NO: 314); 

Ac-R-G-S-G-R-S-S-L-(Dox) (SEQ ID NO: 315); 

Ac-R-G-S-G-R-S-Cha-(Dox) (SEQ ID NO: 316); 

Ac-R-G-S-G-R-S-S-Cha-(Dox) (SEQ ID NO: 317); 

Ac-R-G-S-A-R-S-Cha-(Dox) (SEQ ID NO: 318); 
25 Ac-R-G-S-A-R-S-S-(Dox) (SEQ ID NO: 319); 

Ac-R-G-S-A-R-S-nV-(Dox) (SEQ ID NO: 320); 

Ac-R-G-S-A-R-S-S-nV -(Dox) {SEQ ID NO: 321); 

Ac-R-G-S-A-R-S-L-{Dox) (SEQ ID NO: 322); 

Ac-R-{Me)C-P-G-R-V-V-{Dox) (SEQ ID NO: 323); 
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Ac-R-(Me)C-P-G-R-V-V-(Dox) (SEQ ID NO: 324); 

Ac-R-C{Me)-P-G-R-S-L-(Dox) (SEQ ID NO: 325); 

Ac-R-L-P-G-R-S-L-(Dox) (SEQ ID NO: 326); 

Ac-R-V-P-G-R-S-L-(Dox) (SEQ ID NO: 327); 
5 Ac-R-V-P-G-R-S-L-(Dox) (SEQ ID NO: 328); 

Ac-R-nL-P-G-R-S-L-(Dox) (SEQ ID NO: 329); 

Ac-R-G(tBu)-P-A-R-S-L-(Dox) (SEQ ID NO: 330); 

Ac-R-L-P-A-R-S-L-(Dox) (SEQ ID NO: 331); 

Ac-R-V-P-A-R-S-L-(Dox) (SEQ ID NO: 332); 
10 Ac-R-nL-P-A-R-S-L-(Dox) (SEQ ID NO: 333); 

Ac-l-V-S-G-R-A-L-(Dox) (SEQ ID NO: 334); 

Ac-l-V-S-G-R-S-S-L-(Dox) (SEQ ID NO: 335); 

Ac-l-V-S-G-R-A-S-L-(Dox) (SEQ ID NO: 336); 

Ac-l-V-S-A-R-M-A-(Dox) (SEQ ID NO: 337); 
15 Ac-l-V-S-A-R-nL-A-(Dox) (SEQ ID NO: 338); 

Ac-l-V-S-A-R-S-L-(Dox) (SEQ ID NO: 339); 

Ac-l-V-S-A-R-S-nL-(Dox) (SEQ ID NO: 340); 

Ac-l-V-S-A-R-S-S-L-(Dox) (SEQ ID NO: 341); 

Ac-G-S-G-R-S-A-(Dox) (SEQ ID NO: 585); 
20 Ac-G-S-G-R-S-L-(Dox) (SEQ ID NO: 277); 

Ac-G-S-G-R-A-L-(Dox) (SEQ ID NO: 278); 

Ac-G-S-G-R-S-S-L-(Dox) (SEQ ID NO: 279); 

Ac-G-S-G-R-L-(Dox) (SEQ ID NO: 280); 

Ac-G-S-G-(4-guan)Phg-S-L-NH2 (SEQ ID NO: 281); 
25 Ac-G-S-G-R-S-S-Cha-(Dox) (SEQ ID NO: 282); 

Ac-G-S-G-R-A-S-L-(Dox) (SEQ ID NO: 283); 

Ac-G-S-G-R-S-nL-(Dox) (SEQ ID NO: 284); 

Ac-G-T-G-R-S-nL-(Dox) (SEQ ID NO: 285); 

Succ-bA-T-G-R-S-nL-(Dox) (SEQ ID NO: 286); 
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Ac-G-T-G-R-S-hCha-(Dox) (SEQ ID NO: 287); 
Ac-G-hS-G-R-S-nL-(Dox) (SEQ ID NO: 288); 
Ac-G-dS-A-R-S-A-{Dox) (SEQ ID NO: 289); 
Ac-G-S-A-R-S-L-(Dox) (SEQ ID NO: 290); 
5 Ac-G-S-A-R-S-S-Cha-(Dox) (SEQ ID NO: 291); 
Ac-G-S-A-R-S-S-L-(Dox) (SEQ ID NO: 292); 
. Ac-G-S-A-R-A-S-L-(Dox) (SEQ ID NO: 293); 
Ac-V-S-G-R-S-L-(Dox) (SEQ ID NO: 294); 
Ac-V-S-G-R-A-L-(Dox) (SEQ ID NO: 295); 
10 Ac-V-S-G-R-A-S-L-(Dox) (SEQ ID NO: 296); 
Ac-V-S-G-R-S-S-L-(Dox) (SEQ ID NO: 297); 
Ac-V-S-A-R-M-A-(pox) (SEQ ID NO: 298); 
Ac-V-S-A-R-nL-A-(Dox) (SEQ ID NO: 299); 
Ac-V-S-A-R-S-nL-(Dox) (SEQ JD NO: 300); 
15 Ac-V-S-A-R-S-L-(Dox) (SEQ ID NO: 301); 

Ac-(Me)C-P-G-R-V-V-(Dox) (SEQ ID NO: 302); 
Ac-(Me)C-P-G-R-V-V-(Dox) (SEQ ID NO: 303); 
Ac-C(Me)-P-G-R-A-L-(Dox) (SEQ ID NO: 304); 
Ac-C(Me)-P-G-R-S-L-(Dox) (SEQ ID NO: 305); 
20 Ac-C(Me)-P-A-R-S-L-(Dox) (SEQ ID NO: 306); 
Ac-C(Me)-P-A-R-A-S-L-(Dox) (SEQ ID NO: 307); 
Ac-G(tBu)-P-G-R-S-L-(Dox) (SEQ ID NO: 308); 
Ac-Q-S-R-A-A-(taxol) (SEQ ID NO: 552); 
Ac-Q-S-R-S-A-(taxol) (SEQ ID NO: 553); 
25 Ac-Q-S-R-S-G-(taxoI) (SEQ ID NO: 554); 
Ac-R-S-R-A-A-(taxol) (SEQ ID NO: 555); 
Ac-R-Q-S-R-A-A-(taxol) (SEQ ID NO: 556); 
Ac-R-Q-S-R-S-A-(taxol) (SEQ ID NO: 557); 
Ac-R-Q-S-R-S-A-A-(taxol) (SEQ ID NO: 558); 
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Ac-R-G-S-G-R-S-A-(taxol) (SEQ ID NO: 559); 

Ac-S-G-R-A-A-(taxol) (SEQ ID NO: 560); 

Ac-S-G-R-S-A-(taxol) (SEQ ID NO: 561); 

Ac-S-G-R-S-S-A-(taxol) (SEQ ID NO: 562); 
5 Ac-S-G-R-A-S-A-(taxol) (SEQ ID NO: 563); 

Ac-S-G-R-S-G-(taxol) (SEQ ID NO: 564); 

Ac-S-G-R-S-S-G-(taxoI) (SEQ ID NO: 565); 

Ac-S-G-R-S-G-A-(taxol) (SEQ ID NO: 566); 

Ac-S-G-R-S-G-G-(taxol) (SEQ ID NO:567); 
10 Ac-G-T-G-R-S-G-G-(taxol) (SEQ ID NO: 568); 

Ac-L-R-R-Q-S-R-A-A-(Dox) (SEQ ID NO: 597); 

MeS02-dA(Chx)-Abu-R-S-L-(Dox) (SEQ ID NO: 598); 

Ac-R-A-R-S-L-(Dox) (SEQ ID NO: 599); 

Ac-dA(Chx)-Abu-R-S-L-(Dox) (SEQ ID NO: 600); 
15 Ac-dA(Chx)-Abu-R-S-S-L-{Dox) (SEQ ID NO: 601); 

Ac-Q-G-R-S-S-L-(Dox) (SEQ ID NO: 602); 

MeOCO-dhF-P(OH)-R-S-S-L-(Dox) (SEQ ID NO: 603); 

MeOCO-Quat4-G-R-S-L-(Dox) (SEQ ID NO: 604); 

As-dCha-P(OH)-R-S-S-L-(Dox) (SEQ ID NO: 605); 
20 Ac-dCha-Abu-R-S-S-A-(taxol) (SEQ ID NO: 606); 

MeOCO-Quat2-G-R-S-L-NH2 (SEQ ID NO: 607); 

MeOCO-Quat3-G-R-S-L-NH2 (SEQ ID NO: 608); and 

MeOCO-Quat-G-R-S-L-NH2 (SEQ ID NO: 609). 

EXAMPLE 10 

25 Pharmacokinetic studies of conjugates and fraction of the dose 

metabolized to Doxorubicin and Leucine-doxorubicin in naive and tumor 
bearing mice. 

Naive or tumor bearing nude mice 8-12 weeks of age have been 
used for pharmacokinetic studies of the test conjugates. Tumor cells for 
30 implantation have been prepared following one of three protocols. 
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Protocol A Tumor cells collected from tissue culture 

Tumor cells are trypsinized and resuspended in the growth medium 
and centrifuged for 6 min at 200xg. The cells are resuspended in serum- 
free medium and counted. The appropriate volume of the solution 
5 containing the desired number of cells is then transferred to a conical 
centrifuge tube, centrifuged as before and resuspended in the appropriate 
volume of a cold 1 :1 mixture of cells in phenol free medium : matrigel. 
Each mouse is inoculated with 0.2 - 0.5 mL containing between 1x10® 
and 1x10' tumor cells subcutaneously or orthotopically. 

10 Protocol B Tumor cell suspension 

Established tumors (200-1 OOOmm^) are dissected from mice, 
weighed and rinsed in tumor cell growth medium. The tumors are passed 
through a steel cell dissociation sieve. The cells are rinsed through the 
sieve with growth medium. The cells are centrifuged for 6 min at 200xg 

15 and resuspended in the appropriate volume of a cold 1:1 mixture of cells : 
matrigel. Each mouse is inoculated with 0.2-0.5 mL of tumor cells 
subcutaneously or orthotopically. 
Protocol C Tumor fragments 

Alternatively a tumor measuring approximately SOOmm^ is 

20 dissected out of a mouse, rinsed In tumor cell growth medium and cut 
into 1-2 mm^ fragments. Each fragment is inoculated subcutaneously or 
orthotopically using a trocar needle. 
Pharmacokinetic Study 

NaTve or tumor bearing mice are individually weighed and assigned 

25 to groups. The mice are dosed with 1-100umole/kg, including 
30umole/kg, 25umole/kg, or 21.5umole/kg of the test conjugate 
intraperrtoneally or intravenously. At a given time point between 5 
minutes and 24 hours after administration of the compound the mice are 
sacrificed. Blood is collected in a syringe containing protease inhibitors 



wo 02/095007 



PCT/US02/16819 



-223- 

such as EDTA, AEBSF, Aprotinin, Leupeptin, Bestatin, Pepstanin A or E64 
and transferred into a heparinized blood collection tube. The plasma is 
prepared by centrifugation. The tumors are collected and pulverized in 
liquid nitrogen. The resulting tumor powders are stored at -80°C. The 
5 tumor powders and plasma are extracted and analyzed for the parent test 
conjugate and its products including Leucine-doxorubicin {or norleucine- 
doxorubicin, etc.) and doxorubicin. 

Looking at the delivery of the toxin to the tumor cells, and also 
looking at the parent conjugate and the levels of toxin (dox and nor-leu 
10 dox) in the plasma. 

RESULTS 

For example, test conjugate (21 .5 umoie/kg of Ac-Gly-Ser-Gly-Arg- 
Ser-nLeu-Dox {see Example 2)) was administered to naTve and tumor 
bearing (TB) mice intraperitoneally (IP) or intravenously {IV). One hour 

15 after administration plasma and tumor tissue was collected from the 
mice. Concentrations of the test conjugate and its products are 
compared. The results show that the conjugate does not get into the 
tumor, the toxins (norleu dox and dox - //M concentrations in the tumor at 
one hour following the single {both IP and IV) injection. There were lower 

20 levels of dox and nor-leu dox the plasma than in the tumor. 
Extraction, chromatography LC/MS conditions 
Plasma: Plasma samples are prepared using acetonitrile protein 
precipitation. A standard curve was constructed from addition of 5 to 20 
jjL volumes of a standard compound to 0.1 mL or 0.05 mL volumes of 

25 plasma on ice. The standard curve ranges from 10 ng/mL - 1 ug/mL or 
from 100 ng/mL - 4 ug/mL of the standard compound. Immediumtely 
after standard addition, acetonitrile is added to precipitate the proteins. 
The study plasma samples were prepared by thawing the frozen plasma 
samples on ice. The aliquots were added directly to the acetonitrile. 
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After sample precipitation, the sample is mixed using vortex mixing. The 
precipitate was pelleted using centrifugation. The supernatent was dried 
using vacuum centrifugation. The sample was reconstituted with 0.15 
mL of 30% acetonitrile - 70% (0,01 M ammonium acetate with 0.1% 
5 formic acid). 0.01 mL of the sample was injected for LC-MS analysis. 
The HPLC conditions were a linear gradient of 20% acetonitrile - 80% 
(10 mM ammonium acetate - .1% formic acid) to 50% acetonitrile - 50% 
(10 mM ammonium acetate - .1% formic acid) in 1 minute at 0.3 mL/min 
in a 30 X 2.1 mm Zorbax SB CIS HPLC column. Detection was provided 

10 by a triple quad mass spectrometer with electrospray ionization. 

Doxorubicin was monitored using the m/z transition 544.1 - 396.8. 
Leucine-doxorubicin was monitored using 657.2 - 242.8. An exemplary 
parent conjugate was monitored using 1555.9 -1555.9. Scanning LC- 
MS and fluorescence detection was used to identify cleavage products 

15 other than doxorubicin or leucine-doxorubicin (or norleucine-doxorubicin, 
etc.) in the plasma. 

Tumor: Immediately after excision from the mouse, the tumor for 
analysis is weighed and placed into a mortar containing liquid nitrogen. 
With the mortar nested in a bed of dry ice, the tumor is ground into a 

20 fine powder while additional liquid nitrogen is added as needed to avoid 
thawing. When a homogeneous tumor powder is achieved, the remaining 
liquid nitrogen is allowed to boil off. The tumor powder is quantitatively 
transferred to a 1 5ml conical tube that has been pre-chilled and is on dry 
ice. The sample is stored at -70 °C until analysis. The tumor powder is 

25 thawed on ice and vortex mixed with 0.01 M ammonium acetate in a 1 
gram tumor/mL ammonium acetate solution concentration to form a 
slurry. An aliquot of 0.1 mL of the tumor slurry is precipitated with 0.5 
mL acetonitrile. The supernatant is separated from the precipitated solids 
and then evaporated using vacuum centrifugation. Quantification of 
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doxorubicin, leucine-doxorubicin (or norleucine-docorubicin, etc.), is 
achieved by reference to a standard curve constructed from spiking 
measured amounts of standard compounds {doxorubicin, leucine- 
doxorubicin, etc.) into control tumor slurry. A typical standard curve 
5 ranges from 1 ng to 200 ng of compound per aliquot of tumor slurry. 
After the unknown samples and standards are processed and dried, the 
residue is reconstituted in 0.1 5mL of 30% acetonitrile - 70% (0.01 M 
ammonium acetate + 0.1% formic acid). lO/yL of solution is injected 
onto a liquid chromatography - mass spectrometry system. The HPLC 

10 conditions were a linear gradient of 20% acetonitrile - 80% (10 mM 

ammonium acetate - .1% formic acid) to 50% acetonitrile - 50% (10 mM 
ammonium acetate - .1% formic acid) in 1 minute at 0.3 mL/min in a 30 
X 2.1 mm Zorbax SB CIS HPLC column. Detection was provided by a 
triple quad mass spectrometer with electrospray ionization. Doxorubicin 

15 was monitored using the m/z transition 544.1 - 396.8. Leucine- 
doxorubicin was monitored using 657.2 - 242.8. 

Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended 
20 claims. 
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WHAT IS CLAIMED IS: 

1 . A conjugate, comprising a therapeutic agent and a peptidic 
substrate linked thereto optlonaily via a linker, wherein the peptidic 
substrate is proteolytically cleaved by a cell surface protease or a soluble, 

5 released or shed form thereof, to liberate the therapeutic agent, wherein 
the conjugate is not substantially cleaved by ptasmin or prostate specific 
antigen (PSA). 

2. The conjugate of claim 1, wherein the liberated therapeutic 
agent is active. 

10 3. The conjugate of claim 1, wherein cleavage liberates the 

therapeutic agent in a form that requires further processing for activation. 

4- The conjugate of claim 1 that comprises the components: 
(peptidic substrate)^, (Linker)^, and (therapeutic agent)t; 

wherein at least one peptidic substrate moiety is linked with or 
15 without a linker to at least one therapeutic agent, s is 1 to 6, q is 0 to t, 
and t is 1 to 6, wherein a cell surface protease that cleaves the peptidic 
substrate(s) results in delivery of the therapeutic agent to the cell. 

5. The conjugate of claim 1, wherein the peptidic substrate 
comprises one amino acid or more, wherein, upon proteolytic cleavage of 

20 the conjugate, the resulting therapeutic agent is active or in a form that, 
upon further processing, is active. 

6. The conjugate of claim 1, wherein the cell surface protease 
is a serine protease. 

7. The conjugate of claim 1, wherein the cell surface protease 
25 is a type II transmembrane serine protease (MTSP) or an endotheliase. 

8. The conjugate of claim 1, wherein the cell surface protease 
is selected from endotheliase 1 , endotheliase 2, MTSP1 , MTSP3, MTSP4, 
MTSP6, MTSP7, MTSP9, MTSPIO, MTSP12, MTSP20, MTSP22, 
MTSP25, corin, enterokinase, human airway trypsin-like protease (HAT), 
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TI\/iPRSS2, hepsin, urokinase-type plasminogen activator (uPA), and 
TMPRSS4. 

9. The conjugate of claim 1, wherein the cell surface protease 
comprises a polypeptide selected from the group consisting of 

5 a polypeptide comprising the sequence of amino acids set 

forth in any of SEQ ID Nos. 2, 4, 6. 8, 1 0, 1 2, 1 4, 1 6, 1 8, 20, 22, 24, 
26, 29, 31, 33, 35, 37, 39, 41, 43, 45, 270, 272, 274 and 276; 

a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of high stringency to the sequence of 
10 nucleotides set forth in any of SEQ ID Nos 1 , 3, 5, 7. 9, 1 1 , 1 3, 1 5, 1 7, 
19, 21, 23, 25, 28, 30, 32, 34, 36, 38, 40, 42, 44, 269, 273 and 275; 

a polypeptide that comprises a sequence of amino acids 
having at least about 40% sequence identity with the sequence of amino 
acids set forth in SEQ ID Nos. 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 
15 24, 26, 29, 31, 33, 35, 37, 39, 41, 43, 45, 270, 272, 274 and 276; 
and 

a polypeptide encoded by a splice variant of the sequence of 
nucleotides set forth in any of SEQ ID Nos. 2, 4, 6, 8, 10, 12, 14, 16, 
18, 20, 22, 24, 26, 29, 31, 33, 35, 37, 39, 41, 43, 45, 270, 272, 274 
20 and 276. 

10. The conjugate of claim 1, wherein the therapeutic agent is a 
toxin, a small organic molecule, a nucleic acid, protein therapeutic agents, 
a cytokine or a growth factor. 

1 1 . The conjugate of claim 1, wherein the therapeutic agent is 
25 an anti-cancer agent. 

12. The conjugate of claim 1, wherein the therapeutic agent is 
an anti-angiogenic agent. 

1 3. The conjugate of claim 1 , wherein the therapeutic agent is 
selected from abrin, ricin A, pseudomonas exotoxin Shiga toxin, 
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diphtheria toxin, a tumor necrosis factor, a-interferon, K-'nterferon, nerve 
growth factor, tissue factor and tissue factor variants, FAS-ligand 
platelet derived growth factor, tissue plasminogen activator, interleukin-1 
(IL-1), interleukin-2 {iL-2), interleukin-6 (IL-6), granulocyte macrophage 
5 colony stimulating factor (GMCSF), granulocyte colony stimulating factor 
(G-CSF), erythropoietin (EPO), nerve growth factor, fibroblast growth 
factors (FGFs), and epidermal growth factor. 

14. The conjugate of claim 1, wherein the therapeutic agent is 
selected from alkylating agents, toxins, antiproliferative agents, pro- 

10 apoptotic agents, pro-coagulants, cytotoxic nucleosides and tubulin 
binding agents. 

15. The conjugate of claim 1 , wherein the therapeutic agent is 
selected from among the following classes of drugs: 



20 



15 



a) anthracycline family of drugs, 

b) vinca alkaloid drugs, 

c) mitomycins, 

d) bleomycins, 

e) cytotoxic nucleosides, 

f) pteridine family of drugs. 

g) diynenes. 



h) estramustine. 



25 



i) cyclophosphamide, 

j) taxanes, 

k) podophyllotoxins, 

I) maytansanoids, 

m) epothilones, and 

n) combretastatin and analogs. 



or pharmaceutically acceptable derivatives thereof. 
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16. The conjugate of claim 1 , wherein the therapeutic agent is 
selected from among the following drugs: 

a) doxorubicin, 

b) carminomycin, 
5 c) daunorubicin, 

d) aminopterin, 

e) methotrexate, 

f) methopterin, 

g) dichloro methotrexate, 
10 h) mitomycin C, 

i) porfiromycin, 

j) 5-fluorouracil, 

k) 6-mercaptopurine, 

i) cytbsine arabinoside, 
15 m) podophyllotoxin, 

n) etoposide, 

o) etoposide phosphate, 

p) melphalan, 

q) vinblastine, 
20 r) vincristine, 

s) ieurosidine, 

t) vindesine, 

u) estramustine, 

v) cisplatin, 
25 w) cyclophosphamide, 

x) taxol, 

y) leurositte, 

z) 4-desacetvivinbiastine, 
aa) epothilone B, 



wo 02/095007 PCT/US02/16819 



-230- 

bb) taxotere, 
cc) maytansanol, 
dd) epothilone A, and 
ee) combretastatin and analogs; 
5 or a pharnnaceutically acceptable derivative thereof. 

17. The conjugate of claim 1, further comprising a linker 
between the therapeutic agent and the peptidic substrate. 

1 8. The conjugate of claim 1 7, wherein the linker comprises a 
carbohydrate, peptide, and/or hydrocarbon core. 

10 19. The conjugate of claim 17, wherein the linker comprises: 

a biscarbonyl alkyl diradical whereby an amine moiety on the 
therapeutic agent is connected with the linker unit to form an amide bond 
and the amino terminus of the peptidic substrateJs connected with the 
other end of the linker unit also forming an amide bond; or 

15 a diaminoaikyi diradical linker unit, whereby a carbonyl moiety on 

the therapeutic agent is covalently attached to one of the amines of the 
linker unit while the other amine of the linker unit is covalently attached 
to the C-terminus of the peptidic substrate; or 

is a self-eliminating linker of the following formulae: 

20 



25 



30 
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where A is NH or O; D is N(H or alkyi) or O; R^^ is H, alkyi, cycioalkyi, 
cycloalkylalkyi, aryl, heteroaryl optionally substituted with 1 or more, 
such as, for example, 1 to 3, substituents selected from halo, halo alkyl 
and alkyl, aralkyi, heteroaralkyl, alkenyl containing 1 to 2 double bonds, 
5 alkynyl containing 1 to 2 triple bonds, alk(en)(yn)yl groups, halo, 

pseudohalo, cyano, hydroxy, haloalkyi and polyhaloalkyi, such as, for 
example, halo lower alkyl, especially trifluoromethyl, formyl, 
alkylcarbonyl, arylcarbonyl that optionally is substituted with 1 or more, 
such as, for example, 1 to 3, substituents, for example, selected from 
10 halo, halo alkyl and alkyl, heteroarylcarbonyl, carboxy, alkoxycarbonyl, 
aryloxycarbonyl, aminoimino, alkoxycarbonylamino, 

aryloxycarbonylamino, aminocarbonyl, alkylaminocarbonyl, dialkylamino- 
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carbonyl, arylaminocarbonyl, diarylaminocarbonyl, aralkylaminocarbonyl, 
alkoxy, aryloxy, perfluoroalkoxy, alkenyloxy, alkynyloxy, arylalkoxy, 
aminoalkyi, alkylaminoalkyt, dialkyiaminoalky!, arylaminoalkyl, amino, 
alkylamino, dialkylamino, arylamino, alkylarylamino, alkylcarbonylamino, 
5 arylcarbonylamino, azido, nitro, mercapto, alkylthio, arylthio, 

perfluoroalkylthio, thiocyano, isothiocyano, alkytsulfinyl, alkylsulfonyl, 
arylsulfinyL arytsulfonyi, aminosulfonyl, alkylaminosulfonyl, 
dialkylaminosulfonyl and arylaminosulfonyl; and y is an integer fronn 1 to 
3. 

10 20. The conjugate of claim 17, wherein the linker is a diannine 

comprising a cyclic alkylene moiety. 

21 . The conjugate of claim 17, wherein the diamine contains a 
bicycloalkylene moiety. 

22. The conjugate of claim 1 7, wherein the linker selected from 
15 1 ,4-bis(amlnomethyl)cyclohexane, 1 ,4-bis(aminomethyl)cycloheptane, 

1 .3- bis{aminomethyl)cyclopentane, 1 -amino-4-(aminomethyl)cyclohexane, 

1 .4- diaminocyclohexane and 1 ,4-bis(aminomethyi)bicyclo[2.2.2]octane. 

23. The conjugate of claim 17, wherein the linker is a IrOJ- 
diaminoalkane. 

20 24. The conjugate of claim 1 7, wherein the linker is a 

1 ,3-diaminopropane. 

25. The conjugate of claim 17, wherein the linker is a 1,0*- 
dicarbonylalkane. 

26. The conjugate of claim 25, wherein the linker selected from 
25 oxalic, malonic, succinic, glutaric, adipic and pivalic acids. 

27. The conjugate of claim 1, wherein the peptidic substrate 
comprises PI that is any amino acid. 

28. The conjugate of claim 27, wherein PI is a naturally-occurring 

amino acid. 
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29. The conjugate of claim 27, wherein PI is an amino acid with 
an aromatic, branched, or branched aromatic side chain. 

30. The conjugate of claim 1, wherein the peptidic substrate 
comprises PI, where P1 is selected from among Arg, Lys, Tyr, Phe, Trp, 

5 Ala, Val, lie and Thr. 

31. The conjugate of claim 1, wherein: 

the peptidic substrate comprises a P1-PV bond; 

the PI-PV bond is the site of cleavage by a cell surface protease; 

PI is selected from Arg, Lys, Tyr, Phe, Trp, Ala, Val, Me and Thr; 

10 and 

PV is Gly, Ser, Ala, Leu, lie, d-lle, nLeu, Val, nVal, Aib, Abu, Met 
or 6-aminohexanoyl. 

32. The conjugate of claim 1, wherein the peptidic substrate 
comprises PI, wherein PI is Arg, Lys or an Arg surrogate. 

15 33. The conjugate of claim 1, further comprising a P2 residue 

selected from Phe, Ser, Gly and Ala. 

34. The conjugate of claim 1 , further comprising a P3 residue 
selected from Arg, Lys, Gin, Ser, Quat and Arg surrogates. 

35. The conjugate of claim 1, further comprising a P4 residue 
20 selected from Pro, Arg, Ser, Ala, Lys, Gly, nLeu, Leu, Tyr, Giu, Phe and 

Val. 

36. The conjugate of claim 1, further comprising a P5 residue 
selected from Arg and Arg surrogates. 

37. The conjugate of claim 1, further comprising a P6 residue 
25 selected from Leu, He and Val. 

38. The conjugate of claim 1, further comprising a P2' residue 
selected from Gly, Ser, Ala, Leu, lie, d-lle, nLeu, Val, nVal, Aib, Abu, Met 
and 6-aminohexanoyl. 
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39. The conjugate of claim 1 , further comprising a P3' residue 
selected from Gly, Ser, Ala, Leu, He, nLeu, Val, nVal, Aib, Abu, Met and 
6-aminohexanoyl. 

40. The conjugate of claim 1 , wherein: 

5 the peptidic substrate comprises a 5-mer that has the formula: 

P4-P3-P2-Pl-Pr, wherein: 

PI is selected from among Arg, Lys, Tyr, Phe, Trp, Ala, Val, 

lie and Thr 

P2 is selected from Phe, Ser, Gly and Ala; 
10 P3 is selected from Arg, Lys, Gin, Quat and Arg surrogates; 

P4 is selected from Pro, Arg, Ser, Ala, Lys, Gly, nLeu, Leu, 
Tyr, Glu, Phe and Val; and 

PV is Gly, Ser, Ala, Leu, lie, d-lle, nLeu, Val, nVal, Alb, Abu, 
Met or 6-aminohexanoyl. 
15 41 . The conjugate of claim 40, wherein: 

the peptidic substrate optionally further comprises one or more of a 
P5 or P2' amino acid residue, wherein: 

P5 is Arg or an Arg surrogate; and 

P2' is selected from among Gly, Ser, Ala, Leu, Me, d-lle, nLeu, Val, 
20 nVal, Aib, Abu, Met and 6-aminohexanoyl. 

42. The conjugate of claim 41 , wherein: 

if the peptidic substrate comprises a P5 amino acid residue, then 
the peptidic substrate optionally further comprises a P6 amino acid 
residue selected from Leu, tie and Val; and 
25 if the peptidic substrate comprises a P2' amino acid residue, then 

the peptidic substrate optionally further comprises a P3' amino acid 
residue selected from Gly, Ser, Ala, Leu, lie, nLeu, Val, nVal, Aib, Abu, 
Met and 6-aminohexanoyl. 

43. The conjugate of claim 1 , wherein: 



wo 02/095007 



PCT/US02/16819 



-235- 



the therapeutic agent is conjugated directly or via a linker to the C 
terminus of the peptidic substrate. 

44. The conjugate of claim 1, wherein: 
the peptidic substrate comprises a cap at the N-terminus. 
5 45. The conjugate of claim 1, wherein the cap is a hydrophilic 

blocking group. 

46. The conjugate of claim 1, wherein the cap is an acyl, 
sulfonyl or carbamoyi derivative. 

47. The conjugate of claim 45, wherein the blocking group is 

10 selected from among hydroxylated alkanoyls, polyhydroxylated alkanoyls, 
polyethylene glycols, glycosylates, sugars and crown ethers. 

48. The conjugate of claim 43 that has formula I: 
X"-(P6)^-{P5)p-{P4)r(P3)r(P2)rP1-{PV)^-(P2')HP3'),-(L),-Z 
or a derivative thereof, wherein: 

15 Z is a therapeutic agent; 

L is a linker; 

I, j, i, p and m are selected as follows: 

I is O or 1 ; when I is 0, j, i, p and m are 0; when Ms 1 , j is 0 or 1 ; 
when J Is 0, i, p and m are 0; when j is 1 , i is 0 or 1 ; when i is 0, p and m 
20 are 0; when i is 1 , p is 0 or 1 ; when p is 0, m is 0; when p is 1, m is 0 or 

1; 

u, k and r are selected as follows: 

u is 0 or 1 ; when u is 0, k and r are 0; when u is 1 , k is 0 or 1 ; 
when k is 0, r is 0; when k is 1 , r is 0 or 1 ; 
25 n is 0 or 1 ; 

X" is hydrogen, or an acyl, sulfonyl or carbamoyl cap; 

PI is selected from among Arg, Lys, Tyr, Phe, Trp, Ala, Val, lie and 

Thr; 
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PV is Gly, Ser, Ala, Leu, He, d-Ile, nLeu, Val, nVal, Aib, Abu, Met 
or 6-aminohexanoyl; 

P2 is selected from Phe, Ser, Gly and Ala; 

P3 is selected from Arg, Lys, Gin, Ser, Quat and Arg surrogates; 
5 P4 is selected from Pro, Arg, Ser, Ala, Lys, Gly, nLeu, Leu, Tyr, 

Glu, Phe and Val; 

P5 is selected from Arg and Arg surrogates; 
P6 is selected from Leu, He and Val; 

P2' is selected from Gly, Ser, Ala, Leu, He, d-lle, nLeu, Val, nVal, 
10 Aib, Abu, Met and G-aminohexanoyl; and 

P3' is selected from Gly, Ser, Ala, Leu, He, nLeu, Val, nVal, Aib, 
Abu, Met and 6-aminohexanoyl. 

49, The conjugate of claim 48, wherein PI is Arg, Lys or an Arg 
surrogate. 

15 50. The conjugate of claim 1, wherein: 

the therapeutic agent is conjugated directly or via a linker to the N 
terminus of the peptidic substrate. 

51 . The conjugate of claim 50, wherein: 

the C-terminus of the peptidic substrate is a carboxylic acid or a 
20 carboxamide derivative. 

52. The conjugate of claim 50 that has formula II: 
Z-(L)„-(P6)„-(P5)p-(P4)r(P3),-{P2),-P1-(Pr),-(P2'),-{P3')rX*= 
or a derivative thereof, wherein: 

Z is a therapeutic agent; 
25 L is a linker; 

1, j, i, p and m are selected as follows: 

I is 0 or 1 ; when I is O, j, i, p and m are 0; when I is 1 , j is O or 1 ; 
when j is 0, i, p and m are 0; when j is 1 , i is 0 or 1 ; when i is 0, p and m 
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are 0; when i is 1 , p is 0 or 1 ; when p is 0, m is 0; when p is 1, m is 0 or 
1; 

u, k and r are selected as follows: 

u is 0 or 1 ; when u is 0, k and r are O; when u is 1 , k is 0 or 1 ; 
5 when k is 0, r is 0; when k is 1 , r is 0 or 1 ; 
n is 0 or 1 ; 

X*", together with the carbonyl group of the amino acid residue to 
which it is attached, forms a carboxylic acid or a carboxamide group; 

PI is selected from among Arg, Lys, Tyr, Phe, Trp, Ala, Val, ile and 

10 Thr; 

Pr is Gly, Ser, Ala, Leu, Ile, d-lle, nLeu, Val, nVal, Aib, Abu, Met 
or 6-aminohexanoyl; 

P2 is selected from Phe, Ser, Gly and Ala; 

P3 is selected from Arg, Lys, Gin, Ser, Quat and Arg surrogates; 
15 P4 is selected from Pro, Arg, Ser, Ala, Lys, Gly, nLeu, Leu, Tyr, 

Glu, Phe and Val; 

P5 is selected from Arg and Arg surrogates; 
P6 is selected from Leu, Ile and Val; 

P2' is selected from Gly, Ser, Ala, Leu, lie, d-lle, nLeu, Val, nVal, 
20 Aib, Abu, Met and 6-aminohexanoyl; and 

P3' is selected from Gly, Ser, Ala, Leu, lie, nLeu, Val, nVal, Aib, 
Abu, Met and 6-aminohexanoyl. 

53. The conjugate of claim 52, wherein PI is Arg, Lys or an Arg 
surrogate. 

25 54. The conjugate of claim 1 , wherein a first therapeutic agent is 

attached, optionally via a first linker, to the N-terminus of the peptidic 
substrate; and 

a second therapeutic agent, which are the same or different from 
the first therapeutic agent, is attached, optionally via a second linker. 
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which are the same or different from the first linker, to the C-terminus of 
the peptidic substrate. 

55. The conjugate of claim 54 that has formula III: 
Z^-(U)„-(P6)^-(P5)p-(P4-),-(P3V(P2)rP1-(PV),-(P2'),-(P3'),-(L2),-22 

5 or a derivative thereof, wherein: 

2^ and 7} are each therapeutic agents and are the same or 
different; 

and are each linkers and are the same or different; 
I, j, i, p and m are selected as follows: 
10 I is 0 or 1 ; when I is 0, \, \, p and m are 0; when I is 1 , j is 0 or 1 ; 

when j is 0, i, p and m are 0; when j is 1 , i is 0 or 1 ; when i is 0, p and m 
are 0; when i is 1 , p is 0 or 1 ; when p is 0, m is 0; when p is 1, m is 0 or 

1; 

u, k and r are selected as follows: 
15 u is 0 or 1 ; when u is 0, k and r are 0; when u is 1 , k is 0 or 1 ; 

when k is 0, r is 0; when k is 1 , r is 0 or 1 ; 
n and v are each independently 0 or 1; 

PI is selected from among Arg, Lys, Tyr, Phe, Trp, Ala, Val, He and 

Thr; 

20 PV is Gly, Ser, Ala, Leu, lie, d-lle, nLeu, Val, nVal, Aib, Abu, Met 

or 6-aminohexanoyl; 

P2 is selected from Phe, Ser, Gly and Ala; 

P3 is selected from Arg, Lys, Gin, Ser, Quat and Arg surrogates; 
P4 is selected from Pro, Arg, Ser, Ala, Lys, Gly, nLeu, Leu, Tyr, 

25 Glu, Phe and Val; 

P5 is selected from Arg and Arg surrogates; 
P6 is selected from Leu, lie and Val; 

P2' is selected from Gly, Ser, Ala, Leu, He, d-lle, nLeu, Vat, nVal, 
Aib, Abu, Met and G-aminohexanoyl; and 
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P3' is selected from Gly, Ser, Ala, Leu, ile, nLeu, Val, nVal, Atb, 
Abu, Met and 6-anninohexanoyl. 

56. The conjugate of clainn 55, wherein P1 is Arg, Lys or an Arg 
surrogate. 

5 57. The conjugate of any of claims 1-57, selected from; 

Ac-Leu-Arg-Ala-Quat-Gly-Arg-Ala-Ala-{therapeutic agent) (SEQ ID NO: 

46) ; 

Ac-Leu-Arg-Ala-Quat-Ala-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 

47) ; 

10 Ac-Leu-Arg-Ser-Quat-Gly-Arg-Ala-Ala-(therapeutic agent) {SEQ ID NO: 

48) ; 

Ac-Leu-Arg-Ser-Quat-Ala-Arg-AIa-Ala-(therapeutic agent) (SEQ ID NO: - 

49) ; 

Ac-Leu-Arg-Pro-Arg-Phe-Lys-Ile-lle-(therapeutic agent) (SEQ ID NO: 50); 
15 Ac-Arg-Pro-Arg-Phe-Lys-lle-lle-(therapeutic agent) (SEQ ID NO: 51); 

Ac-Pro-Arg-Phe-Lys-lle-Ile-{therapeutic agent) (SEQ ID NO: 52); 

Ac-Leu-Arg-Ser-Lys-Ser-Arg-Ala-AIa-(therapeutic agent) (SEQ ID NO: 53); 

Ac-Arg-Ser-Lys-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 54); 

Ac-Ser-Lys-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 55); 
20 Ac-Leu-Arg-Pro-Arg-Phe-Arg-He-lle-(therapeutic agent) (SEQ ID NO: 56); 

Ac-Arg-Pro-Arg-Phe-Arg-lle-lle-{therapeutic agent) (SEQ ID NO:57); 

Ac-Pro-Arg-Phe-Arg-lle-lle-(therapeutic agent) (SEQ ID NO: 58); 

Ac-Leu-Arg-Ser-Arg-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 59); 

Ac-Arg-Ser-Arg-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 60); 
25 Ac-Ser-Arg-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 61); 

Ac-Leu-Arg-Ala-Quat-Gly-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 

62) ; 

Ac-Leu-Arg-Ala-Quat-Ala-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 

63) ; 
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Ac-Leu-Arg-Ser-Quat-Gly-Arg-Ala-Ala-{therapeutic agent) (SEQ ID NO: 

64) ; 

Ac-Leu-Arg-Ser-Quat-Ala-Arg-Ata-Ala-(therapeutic agent) (SEQ ID NO: 

65) ; 

5 Ac-Leu-Arg-Pro-Arg-Phe-Lys-lle-lle-{therapeutic agent) (SEQ ID NO: 66); 

Ac-Arg-Pro-Arg-Phe-Lys-lle-lle-(therapeutic agent) (SEQ ID NO: 67); 

Ac-Pro-Arg-Phe-Lys-lle-lle-(therapeutic agent) (SEQ ID NO: 68); 

Ac-Leu-Arg-Ser-Lys-Ser-Arg-Ala-Ala-(therapeutlc agent) (SEQ ID NO: 69); 

Ac-Arg-Ser-Lys-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 70); 
10 Ac-Ser-Lys-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 71); 

Ac-Leu-Arg-Pro-Arg-Phe-Arg-lle-lle- (therapeutic agent) (SEQ ID NO: 72); 

Ac-Arg-Pro-Arg-Phe-Arg-lle-lle-(therapeutic agent) (SEQ ID NO: 73); 

Ac-Pro-Arg-Phe-Arg-Ile-lle-(therapeutic agent) (SEQ ID NO: 74); 

Ac-Leu-Arg-Ser-Arg-Ser-Arg-AIa-Ala-(therapeutlc agent) (SEQ ID NO: 75); 
15 Ac-Arg-Ser-Arg-Ser-Arg-A!a-Ala-(therapeutic agent) (SEQ ID NO: 76); and 

Ac-Ser-Arg-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 77) 

pyroGlu-Pro-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 78); 

CH3S02-D-HHT-Gly-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 79); 

N-p-tosyl-Gly-Pro-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 80); 
20 Benzoyl-VaI-Gly-Arg-AIa-Ala-(therapeutic agent) (SEQ ID NO: 81); 

CH3S02-D-HHT-Gly-Arg-Ala-Aia-(therapeutic agent) (SEQ ID NO: 82); 

N-a-Z-D-Arg-Gly-Ar9-Ala-Ala-(therapeutic agent) in which Z is 

benzyloxycarbonyl (SEQ ID NO: 83); 

pyroGlu-Gly-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 84); 
25 H-D-lle-Pro-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 85); 

Cbo-L-(K)Glu(a-t-BuO)-Gly-Arg-Ala-Ala-(therapeutrc agent) in which Cbo is 
carbobenzoxy (SEQ ID NO: 86); 

H-D-Pro-Phe-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 87); 
H-D-Val-Leu-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 88); 
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Bz-lle-Glu(K-OH)-Gly-Arg-Ala-Ala-(therapeutic agent) in which Bz is 
benzoyl {SEQ ID NO: 89); 

Bz-lle-Glu(K-OMe)-Gly-Arg-Ala-Ala-(therapeutlc agent) (SEQ ID NO: 90); 

Bz-Pro-Phe-Arg-Ala-Ala-{therapeutic agent) (SEQ ID NO: 91); 
5 H-D-Phe-Pip-Arg-Ala-Ala-(therapeutic agent) (SEQ iD NO: 92); 

H-D-Val-Leu-Lys-Ala-Ala-(therapeutic agent) (SEQ ID NO: 93); 

H-D-Nle-HHT-Lys-Ala-Ala-(therapeutic agent) (SEQ ID NO: 94); 

Pyr-Arg-Thr-Lys-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 95); 

H-Arg-Gln-Arg-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 96); 
10 Boc-Gln-Gly-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 97); 

2-Arg-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 98); 

H-D-HHT-Ala-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 99); 

H-D-CHT-Gly-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 100); 

MeS02-dPhe-Pro-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 101); 
15 J-Z-D-Lys-Pro-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 102); 

CH3S02-D-CHA-But-Arg-Ala-Aia-(therapeutic agent) (SEQ ID NO: 103); 

Ac-Arg-Gin-Ser-Arg-AIa-Ala-(therapeutic agent) (SEQ ID NO: 104); 

Ac-Arg-Arg-Gln-Ser-Arg-Ala-Ala-{therapeutic agent) (SEQ ID NO: 105); 

Ac-Leu-Arg-Arg-Gln-Ser-Arg-Ala-Ala-(therapeutic agent) (SEQ ID NO: 
20 106); 

Ac-Arg-Gln-Ser-Arg-Ala-(therapeutic agent) (SEQ ID NO: 107); 
Ac-Arg-Arg-Gln-Ser-Arg-Ala-(therapeutic agent) (SEQ ID NO: 108); 
Ac-Leu-Arg-Arg-Gin-Ser-Arg-Giy-Gly-(therapeutic agent) (SEQ ID NO: 
109); 

25 Ac-Leu-Arg-Arg-Gln-Ser-Arg-Aia-(therapeutic agent) (SEQ ID NO: 110); 
Ac-Arg-Arg-Gln-Ser-Arg-Ile-(therapeutic agent) (SEQ ID NO: 111); 
Ac-Leu-Arg-Arg-Gln-Ser-Arg-Ala-lle-(therapeutic agent) (SEQ ID NO: 112); 
Ac-Leu-Arg-Ala-Quat-GIy-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
113); 
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Ac-Leu-Arg-Ala-Quat-Ala-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
1141; 

Ac-Leu-Arg-Ser-Quat-Gly-Arg-Ser-Leu-{therapeutic agent) (SEQ ID NO: 

115) ; 

5 Ac-Leu-Arg-Ser-Quat-Ala-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

116) ; 

Ac-Leu- Arg-Pro-Arg-Phe-Lys-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

117) ; 

Ac-Arg-Pro-Arg-Phe-Lys-Ser-Leu-(therapeutic agent) (SEQ ID NO: 118); 
10 Ac-Pro-Arg-Phe-Lys-Ser-Leu-{therapeutic agent) (SEQ ID NO: 119); 
Ac-Leu-Arg-Ser-Lys-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
120); 

Ac-Arg-Ser-Lys-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 121); 
Ac-Ser-Lys-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 122); 
15 Ac-Leu-Arg-Pro-Arg-Phe-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
123); 

Ac-Arg-Pro-Arg-Phe-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 124); 
Ac-Pro-Arg-Phe-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 125); 
Ac-Leu-Arg-Ser-Arg-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
20 126); 

Ac-Arg-Ser-Arg-Ser-Arg-Ser-Leu-(therapeut!C agent) (SEQ ID NO: 127); 
Ac-Ser-Arg-Ser-Arg-Ser-Leu-(therapeuticagent) (SEQ ID NO: 128); 
Ac-Leu-Arg-AIa-Quat-GIy-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

129) ; 

25 Ac-Leu-Arg-A!a-QLiat-AIa-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

130) ; 

Ac-Leu-Arg-Ser-Quat-GIy-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

131) ; 
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Ac-Leu-Arg-Ser-Quat-Ala-Arg-Ser-Leu-(therapeutic agent) {SEQ ID NO: 

132) ; 

Ac-Leu-Arg-Pro-Arg-Phe-Lys-Ser-Leu-(therapeutic agent) {SEQ ID NO: 

133) ; 

5 Ac-Arg-Pro-Arg-Phe-Lys-Ser-Leu-{therapeutic agent) (SEQ ID NO: 134); 
Ac-Pro-Arg-Phe-Lys-Ser-Leu-(therapeutic agent) (SEQ ID NO: 135); 
Ac-Leu-Arg-Ser-Lys-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
136); 

Ac-Arg-Ser-Lys-Ser-Arg-Ser-Leu-{therapeutic agent) (SEQ ID NO: 137); 
10 Ac-Ser-Lys-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 138); 
Ac-Leu-Arg-Pro-Arg-Phe-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
139); 

Ac-Arg-Pro-Arg-Phe-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 140); 
Ac-Pro-Arg-Phe-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 141); 
15 Ac-Leu-Arg-Ser-Arg-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
142); 

Ac-Arg-Ser-Arg-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 143); 

Ac-Ser-Arg-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 144); 

pyroGlu-Pro-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 145); 
20 CHgSOj-D-HHT-Gly-Arg-Ser-Leu-Ctherapeutrc agent) (SEQ ID NO: 146); 

N-p-tosyl-Gly-Pro-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 147); 

Ben2oyl-Val-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 148); 

CHaSOj-D-HHT-GIy-Arg-Ser-Leu-ltherapeutic agent) (SEQ ID NO: 149); 

N-a-Z-D-Arg-Gly-Arg-Ser-Leu-{therapeutic agent) (SEQ ID NO: 150) (Z = 
25 benzyloxycarbonyl); 

pyroGlu-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 151); 

H-D-lle-Pro-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 152); 

Cbo-L-(K)Glu(ff-t-BuO)-Gly-Arg-Ser-Leu-{therapeutic agent) (SEQ ID NO: 

153) (Cbo = carbobenzoxy); 



wo 02/095007 



PCT/US02/16819 



-244- 

H-D-Pro-Phe-Arg-Ser-Leu-{therapeutic agent) (SEQ ID NO: 154); 
H-D-VaI-Leu-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 155); 
Bz-lle-Glu(K-OH)-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 156) 
{Bz = benzoyl); 

5 Bz-lle-Glu(K-OMe)-Gly-Arg-Ser-Leu-{therapeutic agent) (SEQ ID NO: 157); 

Benzoyl-Pro-Phe-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 158); 

H-D-Phe-Pip-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 159); 

H-D-Val-Leu-Lys-Ser-Leu-(therapeutic agent) (SEQ ID NO: 160); 

H-D-Nle-HHT-Lys-Ser-Leu-(therapeutic agent) (SEQ ID NO: 161); 
10 Pyr-Arg-Thr-Lys-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 162); 

H-Arg-Gln-Arg-Arg-Ser-Leu-(therapeutlc agent) (SEQ ID NO: 163); 

Boc-Gin-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 164); 

Z-Arg-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 165); 

H-D-HHT-Ala-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 166); 
15 H-D-CHT-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 167); 

MeS02-dPhe-Pro-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 168); 

<J-Z-D-Lys-Pro-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 169); 

CH3S02-D-CHA-But-Arg-Ser-Leu-{therapeutic agent) (SEQ ID NO: 170); 

Ac-Arg-Gln-SBr-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 171); 
20 Ac-Arg-Arg-Gln-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 172); 

Ac-Leu-Arg-Arg-GIn-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

173); 

Ac-Arg-Gln-Ser-Arg-Leu-(therapeutic agent) (SEQ ID NO: 174); 
Ac-Arg-Arg-Gln-Ser-Arg-Leu-(therapeutic agent) (SEQ ID NO: 175); 
25 Ac-Leu-Arg-Arg-GIn-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
176); 

Ac-Leu-Arg-Arg-Gln-Ser-Arg-Leu-(therapeutic agent) (SEQ ID NO: 177); 
Ac-Arg-Arg-Gln-Ser-Arg-Leu-(therapeutic agent) (SEQ ID NO: 178); 
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Ac-Leu-Arg-Arg-Gln-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
179); 

Ac-Arg-Gln-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 180); 

Ac-Arg-Gln-Ala-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 181); 
5 Ac-Arg-Gln-Phe-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 182); 

Ac-Arg-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 183); 

Ac-Arg-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 184); 

Ac-Arg-Ala-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 185); 

Ac-Arg-Phe-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 186); 
10 Ac-Gln-Ser-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 187); 

Ac-Gln-Gly-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 188); 

Ac-Gln-Ala-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 189); 

Ac-Gln-Phe-Arg-Ser-Leu-(therapeutic agent) (SEQ ID NO: 190). 

Ac-Leu-Arg-Ala-Quat-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
15 191); 

Ac-Leu-Arg-Ala-Quat-Ala-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

192) ; 

Ac-Leu-Arg-Ser-Quat-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

193) ; 

20 Ac-Leu-Arg-Ser-Quat-Ala-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

194) ; 

Ac-Leu-Arg-Pro-Arg-Phe-Lys-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

195) ; 

Ac-Arg-Pro-Arg-Phe-Lys-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
25 196); 

Ac-Pro-Arg-Phe-Lys-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 197); 
Ac-Leu-Arg-Ser-Lys-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
198); 
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Ac-Arg-Ser-Lys-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
199); 

Ac-Ser-Lys-Ser-Arg-Ser-Ser-Leu-(therapeutfc agent) (SEQ ID NO: 200); 
Ac-Leu-Arg-Pro-Arg-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
5 201); 

Ac-Arg-Pro-Arg-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
202); 

Ac-Pro-Arg-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 203); 
Ac-Leu-Arg-Ser-Arg-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
10 204); 

Ac-Arg-Ser-Arg-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
205); 

Ac-Ser-Arg-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 206); 
Ac-Leu-Arg-AIa-Quat-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
15 207); 

Ac-Leu-Arg-Ala-Quat-Ala-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

208) ; 

Ac-Leu-Arg-Ser-Quat-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

209) ; 

20 Ac-Leu-Arg-Ser-Quat-Ala-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

210) ; 

Ac-Leu-Arg-Pro-Arg-Phe-Lys-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

211) ; 

Ac-Arg-Pro-Arg-Phe-Lys-Ser"Ser-Leu-(therapeutic agent) (SEQ ID NO: 
25 212); 

Ac-Pro-Arg-Phe-Lys-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 213); 
Ac-Leu-Arg-Ser-Lys-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
214); 
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Ac-Arg-Ser-Lys-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
215); 

Ac-Ser-Lys-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 216); 
Ac-Leu-Arg-Pro-Arg-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
5 217); 

Ac-Arg-Pro-Arg-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
218); 

Ac-Pro-Arg-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 219); 
Ac-Leu-Arg-Ser-Arg-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
10 220); 

Ac-Arg-Ser-Arg-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
221); 

Ac-Ser-Arg-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 222); 
pyroGlu-Pro-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 223); 
15 CH3S02-D-HHT-Gly-Arg-Ser-Ser-Leu-(therapeutlc agent) (SEQ ID NO: 
224); 

N-p-tosyl-Gly-Pro-Arg-Ser-Ser-Leu- (therapeutic agent) (SEQ ID NO: 225); 
Benzoyl-Val-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 226); 
CH3S02-D-HHT-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 
20 227); 

N-a-Z-D-Arg-Gly-Arg-Ser-Ser-Leu-(therapeutrc agent) (SEQ ID NO: 228) (Z 
= benzyloxycarbonyl); 

pyroGlu-Gly-Arg-Ser-Ser-Leu-(therapeutlc agent) (SEQ ID NO: 229); 
H-D-lle-Pro-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 230); 
25 Cbo-L-(K)Glu{Gr-t-BuO)-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID 
NO: 231) (Cbo - carbobenzoxy); 

H-D-Pro-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ (D NO: 232); 
H-D-Val-Leu-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 233); 
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Bz-ile-Glu{K-OH)-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

234) <Bz = benzoyl); 

Bz-lle-Giu(K-OMe)-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

235) ; 

5 Benzoyl-Pro-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 236); 

H-D-Phe-Pip-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 237); 

H-D-Val-Leu-Lys-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 238); 

H-D-Nle-HHT-Lys-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 239); 

Pyr-Arg-Thr-Lys-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 240); 
10 H-Arg-Gln-Arg-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 241); 

Boc-Gln-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 242); 

Z-Arg-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 243); 

H-D-HHT-Ala-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 244); 

H-D-CHT-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 245); 
15 MeS02-dPhe-Pro-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 246); 

<5-Z-D-Lys-Pro-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 247); 

CH3S02-D-CHA-But-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

248); 

Ac-Arg-Gln-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 249); 
20 Ac-Arg-Arg-Gln-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

250) ; 

Ac-Leu-Arg-Arg-Gln-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

251) ; 

Ac-Arg-Gln-Ser-Arg-Leu-(therapeutic agent) (SEQ ID NO: 252); 
25 Ac-Arg-Arg-Gln-Ser-Arg-Leu-(therapeutic agent) (SEQ ID NO; 253); 

Ac-Leu-Arg-Arg-Gln-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 

254) ; 

Ac-Leu-Arg-Arg-Gln-Ser-Arg-Ser-Leu-(therapeutlc agent) (SEQ ID NO: 

255) ; 
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Ac-Arg-Arg-Gln-Ser-Arg-Ser-Leu-{therapeutic agent) (SEQ ID NO: 256); 
Ac-Leu-Arg-Arg-Gln-Ser-Arg-Ser-Ser-Leu-{therapeutic agent) (SEQ ID NO: 
257); 

Ac-Arg-Gln-Gly-Arg-Ser-Ser-Leu-<therapeutic agent) (SEQ ID NO: 258); 
5 Ac-Arg-Gln-Ala-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 259); 

Ac-Arg-GIn-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 260); 

Ac-Arg-Ser-Arg-Ser-Ser-Leu-{therapeutic agent) (SEQ ID NO: 261); 

Ac-Arg-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 262); 

Ac-Arg-AIa-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 263); 
10 Ac-Arg-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 264); 

Ac-GIn-Ser-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 265); 

Ac-GIn-Gly-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 266); 

Ac-Gln-Ala-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 267); and 

Ac-Gln-Phe-Arg-Ser-Ser-Leu-(therapeutic agent) (SEQ ID NO: 268). 
15 58. The conjugate of claim 35, wherein P4 is selected fronn Pro, 

Arg, Ser, Ala, Lys, Giy, nLeu, Phe and Val. 

59. The conjugate of claim 35, wherein: 

P2, P3 and/or P4 Is/are selected from Pro, Arg, Ser, Ala, Lys, Gly, 
nLeu, Leu, Tyr, GLu, Phe and VaL 
20 60. The conjugate of claim 35, wherein: 

P2, P3 and/or P4 is/are selected from Pro, Arg, Ser, Ala, Lys, Gly, 
nLeu, Tyr, Glu, Leu Phe and Vat; and 

PI is any amino acid. 

61. The conjugate of claim 60, wherein PI is a naturally- 
25 occurring amino acid. 

62. The conjugate of claim 60, wherein PI is an amimo acid with 
an aromatic, branched, or branched aromatic side chain. 

63. The conjugate of claim 60, wherein PI is selected from 
among Arg, Lys, Tyr, Phe, Trp, Ala, Val, lie and Thr. 
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64. The conjugate of claim 60, wherein PI is Arg, Lys or an Arg 
surrogate. 

65. The conjugate of claim 1, wherein the protease is located at 
the cell surface by virtue of a specific binding interaction with a receptor 

5 therefor. 

66. The conjugate of claim 65, wherein the cell surface protease 
is urokinase plasminogen activator (u-PA) bound to urokinase 
plasminogen activator receptor (u-PAR). 

67. The conjugate of claim 1, that comprises a peptidic 

10 substrate of the formula P6-P5-P4-P3-P2-P1-P1 '-P2'-P3', wherein each of 
PI, P2, P3, P4, P5, P6, Pr and P2' are selected from residues set forth 
in Figures 1 and 2, and P6, P5, P4, P2' and P3' are optional. 

68. The conjugate of claim 67, wherein: 

P6 is optional and is selected from U V, R; 
15 P5 is optional and is selected from R, I, L; 

P4 is optional and is selected from G, C, V; 
P3 is selected from S, dS, P, A or G; 
P2 is selected from A or G; 
PI is R; 

20 PI ' is S, V, M or nL; 

P2' is optional and is selected S, L, A or V; and 
P3' is optional and is L. 

69. A conjugate selected from among those set forth in Figures 
1-5, wherein the therapeutic agent doxorubicin (Dox) or taxol (Tax) 

25 optionally is replaced with any therapeutic agent. 

70. The conjugate of claim 65, wherein the therapeutic agent is 
a toxin, a small organic molecule, a nucleic acid, protein therapeutic 
agents, a cytokine or a growth factor. 
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71 . The conjugate of claim 65, wherein the therapeutic agent is 
an anti-cancer agent. 

72. The conjugate of claim 65, wherein the therapeutic agent is 
an anti-angiogenic agent. 

5 73. The conjugate of claim 65, wherein' the therapeutic agent is 

selected from abrin, ricin A, pseudomonas exotoxin shiga toxin, 
diphtheria toxin, a tumor necrosis factor, a-interferon, K-interferon, nerve 
growth factor, tissue factor and tissue factor variants, FAS-ligand 
platelet derived growth factor, tissue plasminogen activator, interleukin-1 
10 {IL-D, interleukin-2 (IL-2), interleukin-6 {IL-6), granulocyte macrophage 
colony stimulating factor (GMCSF), granulocyte colony stimulating factor 
(G-CSF), erythropoietin (EPO), nerve growth factor, fibroblast growth 
factors (FGFs), and epidermal growth factor. 

74. The conjugate of claim 65, wherein the therapeutic agent is 
1 5 selected from alkylating agents, toxins, antiproliferative agents, pro- 

apoptotic agents, pro-coagulants, cytotoxic nucleosides and tubulin 
binding agents. 

75. The conjugate of claim 65, wherein the therapeutic agent is 
selected from among the following classes of drugs: 

20 a) anthracycline family of drugs, 

b) vinca alkaloid drugs, 

c) mitomycins, 

d) bleomycins, 

e) cytotoxic nucleosides, 
25 f) pteridine family of drugs. 

g) diynenes, 

h) estramustlne, 

i) cyclophosphamide, 
]) taxanes. 
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k) podophyllotoxins, 
I) maytansanoids, 
m) epothiiones, and 
n) combretastatin and analogs, 
5 or pharmaceutically acceptable derivatives thereof. 

76. The conjugate of clainn 65, wherein the therapeutic agent is 
selected from among the following drugs: 

a) doxorubicin, 

b) carminomycin, 
10 c) daunorubicin, 

d) aminopterin, 

e) methotrexate, 

f) methopterin, 

g) dtchloromethotrexate, 
15 h) mitomycin C, 

i) porfiromycin, 

j) 5-fluorouracil, 

k) 6-mercaptopurine, 

I) cytosine arabinostde, 
20 m) podophyllotoxin, 

n) etoposide, 

o) etoposide phosphate, 

p) melphalan, 

q) vinblastine, 
25 r) vincristine, 

s) leurosidine, 

t) vindesine, 

u) estramustine, 

v) cisplatin. 
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w) cyclophosphamide, 



x) taxol. 



5 



y) leurositte, 

z) 4-desacetylvinbIastine, 
aa) epothilone B, 



bb) 



taxotere, 



dd) 



ee) 



cc) 



maytansanol, 
epothilone A, and 
combretastatin and analogs; 



10 or a pharmaceutically acceptable derivative thereof. 

77. The conjugate of ciaim 65, further comprising a linker 
between the therapeutic agent and the peptidic substrate. 

78. The conjugate of claim 65, wherein the linker comprises a 
carbohydrate, peptide, and/or hydrocarbon core. 

15 79. The conjugate of claim 77, wherein the linker comprises: 

a biscarbonyl alkyl diradical whereby an amine moiety on the 
therapeutic agent is connected with the linker unit to form an amide bond 
and the amino terminus of the peptidic substrate is connected with the 
other end of the linker unit also forming an amide bond; or 

20 a diaminoalkyi diradical linker unit, whereby a carbonyl moiety on 

the therapeutic agent is covalently attached to one of the amines of the 
linker unit while the other amine of the linker unit is covalently attached 
to the C-terminus of the peptidic substrate; or 

is a self-eliminating linker of the following formulae: 



25 
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where A is NH or O; D is N(H or alkyi) or O; R^^ is H, alkyl, cycloalkyl, 
cycloaikylalkyl, aryl, heteroaryl optionally substituted with 1 or more, 
such as 1 to 3, substituents selected from, for example, halo, halo alkyI 
and alkyl, aralkyl, heteroaralkyl, alkenyl containing 1 to 2 double bonds, 
5 alkynyl containing 1 to 2 triple bonds, alk(en)(yn)yl groups, halo, 

pseudohalo, cyano, hydroxy, haloalkyi and polyhaloalkyi, such as, for 
example, halo lower alkyl, including trifluoromethyl, formyl, alkylcarbonyl, 
arylcarbonyl that optionally is substituted with 1 or more, such as, for 
example, 1 to 3, substituents selected from, for example, hato, halo alkyl 
10 and alkyl, heteroarylcarbonyl, carboxy, alkoxycarbonyl, aryloxycarbonyl, 
aminormino, alkoxycarbonylamino, aryloxycarbonylamino, aminocarbonyl, 
alkylaminocarbonyl, dialkylaminocarbonyl, arylaminocarbonyl. 
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diarylaminocarbonyl, aralkylaminocarbonyl, alkoxy, aryloxy, 
perfluoroalkoxy, alkenyloxy, alkynyloxy, aryialkoxy, aminoalkyl, alkyl- 
aminoalkyt, dtaikylamrnoaikyl, arylaminoalkyi, amino, alkylamino, dialkyl- 
amino, arylamino, aikylarylamino, alkylcarbonylamino, arylcarbonylamino, 
5 azido, nitro, mercapto, alkylthio, arylthio, perfluoroalkylthio, thiocyano, 
isothiocyano, alkylsulfinyl, alkylsulfonyl, arylsulfinyl, arylsulfonyl, 
aminosulfonyl, alkylaminosulfonyl, dialkylaminosulfonyl and arylamino- 
sulfonyl; and y is an integer from 1 to 3. 

80. The conjugate of claim 77, wherein the linker is a diamine 
10 comprising a cyclic alkylene moiety. 

81 . The conjugate of claim 77, wherein the diamine contains a 
bicycloalkylene moiety. 

82. The conjugate of claim 77, wherein the linker selected from 
1 ,4-bis(aminomethyl)cyclohexane, 1 ,4-bis(aminomethyl)cycloheptane, 

15 1 ,3-bis(amrnomethyl)cyclopentane, 1-amino-4-(aminomethyI)cyclohexane, 
1 ,4-diaminocyclohexane and 1 ,4-bis{aminomethyl)bicyclo[2.2.2]octane. 

83. The conjugate of claim 77, wherein the linker is a l,ti/- 
diaminoalkane. 

84. The conjugate of claim 77, wherein the linker Is a 
20 1 ,3-diaminopropane. 

85. The conjugate of claim 77, wherein the linker is a ^,aJ- 
dicarbonylalkane. 

86. The conjugate of claim 77, wherein the linker is selected 
from oxalic, malonrc, succinic, glutaric, adtpic and pivallc acids. 

25 87. The conjugate of any of claims 1-30, wherein: 

the peptidic substrate comprises a PI -PI' bond; 
the PI -PI ' bond is the site of cleavage by a cell surface protease; 
PI is selected from Arg, Lys, Tyr, Phe, Trp, Ala, Val, He and Thr; 

and 
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PT is Gly, Ser, hSer, Thr, Ala, Leu, lie, d-lle, nLeu, Val, nVal, Aib, 
Abu, Met or 6-aminohexanoyl. 

88. The conjugate of any of claims 1-32, further comprising a P2 
residue selected from Phe, Ser, Gly, Ala, Ser(OI\/le), hSer, 1-methylHis, 3- 

5 methylHis, His, nVal, nLeu, Abu, (hS)G)y, Thr, Alb, CHA and Tyr. 

89. The conjugate of any of claims 1-33, further comprising a P3 
residue selected from Arg, Lys, Gin, Quat, Arg surrogates, Ser, Thr, hSer, 
dSer, Pro, {hS)Gly, Tyr, 4,4-dimethylThr, Asn, MetiOj), Quat^, Quat^, 
Quat* and Quat^. 

10 90. The conjugate of any of claims 1-34, further comprising a P4 

residue selected from Pro, Arg, Ser, Ala, Lys, Gly, nLeu, Leu, Tyr, GIu, 
Phe, Val, N,N-dimethylGly, )ff-Ala, Cys(Me), Gin, t-butylGly and nVal. 

91. The conjugate of any of claims 1-35, further comprising a P5 
residue selected from lie, Arg and Arg surrogates. 
15 92. The conjugate of any of claims 1-36, further comprising a P6 

residue selected from Val, Leu, Me and Val. 

93. The conjugate of any of claims 1-37, further comprising a 
P2' residue selected from Gly, Ser, Ala, Leu, lie, d-lle, nLeu, Vaf, nVal, 
Aib, Abu, Met, 6-aminohexanoyl, hCHA, CHA, hexylGly, allylGly and Phe. 
20 94. The conjugate of any of claims 1-38, further comprising a 

P3' residue selected from Gly, Ser, Ala, Leu, lie, nLeu, Val, nVal, Aib, 
Abu, Met, 6-aminohexanoyl, CHA and allylGly. 

95. The conjugate of any of claims 1-39, further comprising a 
P4' residue selected from Gly, Ser, Ala, Leu, He, nLeu, Val, nVal, Aib, 
25 Abu, Met, 6-aminohexanoyl, CHA and aliylGly. 

96- The conjugate of any of claims 1-39, wherein P4' is Gly, 
Ser, Ala, Leu, lie, nLeu, Val, nVal, Aib, Abu, Met and 6-aminohexanoyl. 

97. The conjugate of any of claims 1-39, wherein P4' is Leu. 

98. The conjugate of any of claims 1-39, wherein: 
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the peptidic substrate comprises a 5-mer that has the formula: 
P4-P3-P2-P1-Pr, wherein: 

PI is selected from among Arg, Lys, Tyr, Phe, Trp, Ala, Val, 

lie and Thr; 

5 P2 is selected from Phe, Ser, Gly, Ala, Ser(OMe), hSer, 1- 

methylHis, 3-methylHis, His, nVal, nLeu, Abu, (hS)Gly, Thr, Aib, CHA and 
Tyr; 

P3 is selected from Arg, Lys, Gin, Quat, Arg surrogates, Ser, 
Thr, hSer, dSer, Pro, (hS)Gly, Tyr, 4,4-dimethylThr, Asn, MetlOj), Quat^, 
10 Quat^, Quat* and Quat^; 

P4 is selected from Pro, Arg, Ser, Ala, Lys, Gly, nLeu, Leu, 
Tyr, Glu, Phe, Val, N,N-dimethylGly, $-A\a, Cys{IVle), Gin, t-butylGly and 
nVal; and 

PV is Gly, Ser, hSer, Thr, Ala, Leu, lie, d-lle, nLeu, Val, 
15 nVal, Aib, Abu, Met or 6-aminohexanoyl. 

99. The conjugate of claim 40, wherein: 

the peptidic substrate optionally further comprises one or more of a 
P5 or P2' amino actd residue, wherein: 

P5 is lie, Arg or an Arg surrogate; and 
20 P2' is selected from among Gly, Ser, Ala, Leu, lie, d-lle, nLeu, Val, 

nVal, Aib, Abu, Met, 6-aminohexanoyl, hCHA, CHA, hexylGly, allylGly 
and Phe. 

100. The conjugate of claim 41, wherein: 

if the peptidic substrate comprises a P5 amino acid residue, then 
25 the peptidic substrate optionally further comprises a P6 amino acid 
residue selected from Arg, Leu, lie and Val; and 

if the peptidic substrate comprises a P2' amino acid residue, then 
the peptidic substrate optionally further comprises a P3' amino acid 
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residue selected from Gly, Ser, Ala, Leu, He, nLeu, Val, nVal, Aib, Abu, 
Met, 6-aminohexanoyl, CHA and allylGly; and 

if the peptidic substrate comprises a P3' amino acid residue, then 
the peptidic substrate optionally further comprises a P4' amino acid 
5 residue selected from Gly, Ser, Ala, Leu, lie, nLeu, Val, nVal, Aib, Abu, 
Met, 6-aminohexanoyL CHA and allylGly. 

101 . The conjugate of any of claims 43-47 that has formula IV: 
X"-(P6L-(P5)p-(P4)HP3V(P2)rP1-(Pr),-(P2'),-(P3'),-{P4'),-(L)n-Z 
or a derivative thereof, wherein: 
10 Z is a therapeutic agent; 

L is a linker; 

1, j, i, p and m are selected as follows: 

I is 0 or 1 ; when I is 0, j, i, p and m are 0; when I is 1 , j is 0 or 1 ; 
when j is 0, i, p and m are 0; when j is 1 , i is 0 or 1 ; when i is 0, p and m 
15 are 0; when i is 1 , p is 0 or 1 ; when p is 0, m is 0; when p is 1 , m is 0 or 
1; 

u, k, r and s are selected as follows: 

u is 0 or 1 ; when u is 0, k, r and s are 0; when u is 1 , k is 0 or 1 ; 
when k is 0, r and s are 0; when k is 1 , r is 0 or 1 ; when r is 0, s is 0; 
20 when r is 1 , s is 0 or 1 ; 
n is 0 or 1 ; 

X" is hydrogen, or an acyl, sulfonyl or carbamoyl cap; 

PI is selected from among Arg, Lys, Tyr, Phe, Trp, Ala, Val, He and 

Thr; 

25 PI ' is Gly, Ser, Ala, Leu, tie, d-lle, nLeu, Val, nVal, Aib, Abu, Met 

or 6-aminohexanoyl; 

P2 is selected from Phe, Ser, Gly and Ala; 

P3 is selected from Arg, Lys, Gin, Ser, Quat and Arg surrogates; 
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P4 is selected from Pro, Arg, Ser, Ala, Lys, Gly, nLeu, Leu, Tyr, 
Glu, Phe and Val; 

P5 is selected from Arg and Arg surrogates; 

P6 is selected from Leu, Me and Val; 
5 P2' is selected from Gly, Ser, Ala, Leu, lie, d-Ue, nLeu, Val, nVal, 

Aib, Abu, Met and 6-amtnohexanoyl; 

P3' is selected from Gly, Ser, Ala, Leu, lie, nLeu, Val, nVal, Aib, 
Abu, Met and 6-aminohexanoyl; and 

P4' is selected from Gly, Ser, Ala, Leu, lie, nLeu, Val, nVal, Aib, 
10 Abu, Met and 6-aminohexanoyl. 

102. The conjugate of claim 50 or claim 51 that has formula V: 
Z-iU„-(P6)„-(P5)p-(P4y,-{P3)j-(P2)rP1-(PV)u-(P2')k-(P3'),-{P4'),-X'' 
or a derivative thereof, wherein: 

Z is a therapeutic agent; 
15 L is a linker; 

I, j, i, p and m are selected as follows: 

I is 0 or 1 ; when I is 0, J, i, p and m are 0; when 1 is 1 , j is O or 1 ; 
when j is 0, i, p and m are 0; when j is 1 , i is 0 or 1 ; when i is 0, p and m 
are 0; when r is 1 , p is 0 or 1 ; when p is 0, m is 0; when p is 1 , m is 0 or 
20 1; 

u, k, r and s are selected as follows: 

u is 0 or 1 ; when u is 0, k, r and s are 0; when u is 1 , k is 0 or 1 ; 
when k is 0, r and s are 0; when k is 1 , r is 0 or 1 ; when r is O, s is O; 
when r is 1 , s is 0 or 1 ; 
25 n is 0 or 1 ; 

X'^, together with the carbonyl group of the amino acid residue to 
which it is attached, forms a carboxylic acid or a carboxamide group; 

PI is selected from among Arg, Lys, Tyr, Phe, Trp, Ala, Val, He and 

Thr; 
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PV is Gly, Ser, Ala, Leu, lie, d-lie, nUeu, Val, nVal, Aib, Abu, Met 
or 6-aminohexanoyl; 

P2 is selected from Phe, Ser, Gly and A!a; 

P3 is selected from Arg, Lys, Gin, Ser, Quat and Arg surrogates; 
5 P4 is selected from Pro, Arg, Ser, Ala, Lys, Gly, nLeu, Leu, Tyr, 

Glu, Phe and Val; 

P5 is selected from Arg and Arg surrogates; 
P6 is selected from Leu, lie and Val; 

P2' is selected from Gly, Ser, Ala, Leu, lie, d-)Ie, nLeu, Val, nVal, 
10 Aib, Abu, Met and 6-aminohexanoyl; 

P3' is selected from Gly, Ser, Ala, Leu, lie, nLeu, Val, nVal, Aib, 
Abu, Met and 6-aminohexanoyl; and 

P4' is selected from Gly, Ser, Ala, Leu, He, nLeu, Val, nVal, Aib, 
Abu, Met and 6-aminohexanoyl. 
15 103. The conjugate of claim 54 that has formula VI: 

Z^-(L^),-(P6)^-{P5)p-(P4)r{P3)r(P2)rP1-(Pr)u-(P2')k-(P3'K-{P4'),-{L2),-Z2 
or a derivative thereof, wherein: 

7} and l} are each therapeutic agents and are the same or 
different; 

20 L^ and \} are each linkers and are the same or different; 

I, j, i, p and m are selected as follows: 

1 is O or 1 ; when I is 0, j, i, p and m are 0; when lis 1 , j is 0 or 1 ; 
when j is 0, i, p and m are 0; when j is 1 , i is 0 or 1 ; when i is 0, p and m 
are 0; when i is 1 , p is 0 or 1 ; when p is 0, m is 0; when p is 1 , m is 0 or 
25 1; 

u, k, r and s are selected as follows: 

u is O or 1 ; when u Is 0, k, r and s are 0; when u is 1 , k is 0 or 1 ; 
when k is 0, r and s are 0; when k is 1 , r is 0 or 1 ; when r is 0, s is 0; 
when r is 1 , s is 0 or 1 ; 
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n and v are each independently 0 or 1 ; 

PI is selected from among Arg, Lys, Tyr, Phe, Trp, Ala, Val, lie and 

Thr; 

Pr is Gly, Ser, Ala, Leu, lie, d-lle, nLeu, Val, nVal, Aib, Abu, Met 
5 or 6-aminohexanoyl; 

P2 is selected from Phe, Ser, Gly and Ala; 

P3 is selected from Arg, Lys, Gin, Ser, Quat and Arg surrogates; 
P4 Is selected from Pro, Arg, Ser, Ala, Lys, Gly, nLeu, Leu, Tyr, 
Glu, Phe and Val; 
10 P5 is selected from Arg and Arg surrogates; 

P6 is selected from Leu, lie and Val; 

P2' is selected from Gly, Ser, Ala, Leu, Me, d-lle, nLeu, Val, nVat, 
Aib, Abu, Met and 6-aminohBxanoyl; 

P3' is selected from Gly, Ser, Ala, Leu, Me, nLeu, Val, nVal, Aib, 
15 Abu, Met and 6-aminohexanoyl; and 

P4' is selected from Gly, Ser, Ala, Leu, lie, nLeu, Val, nVal, Aib, 
Abu, Met and 6-aminohexanoyl. 

104. The conjugate of any of claims 1-49, selected from: 
Ac-R-Q-G-R-S-L-(therapeutic agent) (SEQ ID NO: 491); 
20 Ac-R-Q-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 492); 
Ac-R-Q-G-R-S-nL-(therapeutic agent) {SEQ ID NO: 493); 
Ac-R-Q-G-R-S-nV-(therapeutic agent) (SEQ ID NO: 494); 
Ac-R-Q-G-R-S-F-(therapeutic agent) (SEQ ID NO: 495); 
Ac-R-Q-G-R-A-L-(therapeutic agent) (SEQ ID NO: 496); 
25 Ac-R-Q-G-R-A-L-(therapeutlc agent) (SEQ ID NO: 497); 
Ac-R-Q-G-R-A-nL-(therapeutic agent) (SEQ ID NO: 498); 
Ac-R-Q-G-R-A-nL-(therapeutic agent) (SEQ ID NO: 499); 
Ac-R-Q-G-R-A-nV-(therapeutic agent) (SEQ ID NO: 500); 
Ac-R-Q-G-R-A-Cha-(therapeutlc agent) (SEQ ID NO: 501); 
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Ac-R-Q-G-R-A-F-(therapeutic agent) (SEQ ID NO: 502); 

Ac-R-N-G-R-S-L-(therapeutic agent) (SEQ ID NO: 503); 

Ac-R-N-G-R-A-nL-(therapeutic agent) (SEQ ID NO: 504); 

Ac-R-a-A-R-S-L-(therapeutic agent) (SEQ ID NO: 505); 
5 Ac-R-Q-A-R-S-nL-(therapeutic agent) (SEQ ID NO: 506); 

Ac-R-Q-A-R-S-nV-(therapeutic agent) (SEQ ID NO: 507); 

Ac-R-Q-A-A-S-Cha-(therapeutic agent). {SEQ ID NO: 508); 

Ac-R-Q-A-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 509); 

Ac-R-Q-A-R-T-nL-(therapeutic agent) (SEQ ID NO: 510); 
10 Ac-R-Q-A-R-A-L-(therapeutic agent) (SEQ ID NO: 51 1); 

Ac-R-Q-A-R-A-nL-(therapeutic agent) (SEQ ID NO: 512); 

Ac-R-Q-A-R-A-nV-(therapeutic agent) (SEQ ID NO: 513); 

Ac-R-Q-A-R-A-Cha-(therapeutic agent) (SEQ ID NO: 514); 

Ac-R-Q-S-R-A-A-(therapeutic agent) (SEQ ID NO: 515); 
15 Ac-R-Q-S-R-A-(therapeutic agent) (SEQ ID NO: 516); 

Ac-R-Q-S-R-A-nL-(therapeutic agent) (SEQ ID NO: 517); 

Ac-RrQ-S-R-A-L-{therapeutlc agent) (SEQ ID NO: 518); 

Ac-R-Q-S-R-A-nV-(therapeutic agent) (SEQ ID NO: 519); 

Ac-R-Q-S-R-A-Cha-(therapeutic agent) (SEQ ID NO: 520); 
20 Ac-R-Q-S-R-S-S-L- (therapeutic agent) (SEQ ID NO: 521); 

Ac-R-Q-S-R-S-L-(therapeutic agent) (SEQ ID NO: 522); 

Ac-R-Q-S-R-S-nL-(therapeutlc agent) (SEQ ID NO: 523); 

Ac-R-Q-S-R-S-nL-(therapeutic agent) (SEQ ID NO: 524); 

Ac-R-Q-S-R-S-nV-(therapeutlc agent) (SEQ ID NO: 525); 
25 Ac-R-Q-S-R-S-aIlylG-(therapeutic agent) (SEQ ID NO: 526); 

Ac-R-Q-S-R-S-Cha-(therapeutic agent) (SEQ ID NO: 527); 

Ac-R-Q-S-R-T-nL-(therapeutic agent) (SEQ ID NO: 528); 

Ac-R-Q-T-R-S-S-L-(therapeutic agent) (SEQ ID NO: 529); 

Ac-R-Q-T-R-S-L-(therapeutic agent) (SEQ ID NO: 530); 
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Ac-R-N-S-R-S-nL-(therapeutic agent) (SEQ ID NO: 531); 

Ac-R-Q-F-R-S-L-(therapeuttc agent) (SEQ ID NO: 532); 

Ac-R-Q-F-R-S-nL-(therapeutic agent) (SEQ ID NO: 534); 

Ac-R-Q-F-R-S-nV-(therapeutic agent) (SEQ ID NO: 535); 
5 Ac-R-Q-F-R-S-nL-{therapeutic agent) (SEQ ID NO: 536); 

Ac-R-Q-F-R-S-Cha-(therapeut(C agent) (SEQ ID NO: 537); 

Ac-R-Q-F-R-A-L-(therapeutic agent) (SEQ ID NO: 538); 

Ac-R-Q-F-R-A-nL-(therapeutic agent) (SEQ ID NO: 539); 

Ac-R-Q-F-R-A-nV-(therapeutlc agent) (SEQ ID NO: 540); 
10 Ac-R-Q-F-R-A-Cha-(therapeutic agent) (SEQ ID NO: 541); 

Ac-Q-S-R-S-S-nL-(therapeutlc agent) (SEQ ID NO: 542); 

MeOCO-Quat2-G-R-S-L-(therapeutic agent) (SEQ ID NO: 483); 

MeOCO-Quat3-G-R-S-L-(therapeutic agent) (SEQ ID NO: 484); 

MeOCO-Quat-G-R-S-L-(therapeutic agent) (SEQ ID NO: 485); 
15 MeOCO-Quat4-G-R-S-L-(therapeutic agent) (SEQ ID NO: 486); 

MeOCO-Quat5-G-R-S-L-(therapeutic agent) (SEQ ID NO: 487); 

MeOCO-Quat2-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 488); 

Me0C0-Quat4-G-R-S-L-{therapeutic agent) (SEQ ID NO: 489); 

MeOCO-Quat2-G-R-S-L-(therapeutic agent) (SEQ ID NO: 490); 
20 Ac-Q-G-R-S-L-(therapeutic agent). (SEQ ID NO: 445); 

Ac-Q-G-R-S-S-L-{therapeutic agent) (SEQ ID NO: 446); 

Ac-Q-G-R-A-S-L-(therapeutic agent) (SEQ ID NO: 447); 

Ac-N-G-R-S-S-L-(therapeutlc agent) (SEQ ID NO: 448); 

Ac-Q-G-R-S-S-nL-(therapeutic agent) (SEQ ID NO: 449); 
25 Ac-Q-G-R-S-S-nV-(therapeutic agent) (SEQ ID NO: 450); 

Ac-Q-G-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 451); 

Ac-Q-G-R-S-S-allylG-(therapeutic agent) (SEQ ID NO: 452); 

Ac-Q-G-R-S-S-allylG-(therapeutic agent) (SEQ ID NO: 453); 

Ac-Q-A-R-S-L-(therapeutic agent) (SEQ ID NO: 454); 
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Ac-Q-A-R-S-S-L-(therapeutic agent) (SEQ ID NO: 455); 

Ac-Q-S-R-S-L-(therapeutic agent) (SEQ ID NO: 456); 

Ac-Q-S-R-S-S-nV-{therapeutic agent) (SEQ ID NO: 457); 

Ac-Q-S-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 458); 
5 Ac-a-S-R-S-S-L-(therapeutic agent) (SEQ ID NO: 459); 

Ac-Q-T-R-S-S-L-(therapeutic agent) (SEQ ID NO: 460); 

Ac-Q-Aib-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 461 ); 

Ac-Q-Aib -R-S-S-L-(therapeutic agent) (SEQ ID NO: 462); 

Ac-Q-Abu-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 463); 
10 Ac-Q-Abu-R-S-S-L-(therapeutic agent) (SEQ ID NO: 464); 

Ac-Q-Cha-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 465); 

Ac-Q-F-R-S-L-(therapeutic agent) (SEQ ID NO: 466); 

Ac-Q-F-R-S-S-L-(therapeutic agent) (SEQ ID NO: 467); 

Ac-Q-Y-R-S-S-L-(therapeutic agent) (SEQ ID NO: 468); 
15 Ac-R-G-R-S-L-(therapeutic agent) (SEQ ID NO: 469); 

Ac-R-G-R-S-S-L-(therapeutic agent} (SEQ ID NO: 470); 

Ac-R-G-R-S-S-Cha-(therapeutlc agent) (SEQ ID NO: 471); 

Ac-R-G-R-S-Cha-(therapeutic agent) (SEQ ID NO: 472); 

Ac-R-A-R-S-L-(therapeutic agent) (SEQ ID NO: 473); 
20 Ac-R-A-R-S-S-L-(therapeutic agent) (SEQ ID NO: 474); 

Ac-R-S-R-S-L-(therapeutic agent) (SEQ ID NO: 475); 

Ac-R-S-R-S-S-L-(therapeutic agent) (SEQ ID NO: 476); 

Ac-R-S-R-S-Cha-(therapeutic agent) (SEQ ID NO: 477); 

Ac-R-S-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 478); 
25 Ac-R-F-R-S-L-{therapeutic agent) (SEQ ID NO: 479); 

Ac-R-F-R-S-Cha-(therapeutic agent) (SEQ ID NO: 480); 

Ac-Y-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 481); 

Ac-M(02)-S-R-S-L-(therapeutic agent) (SEQ ID NO: 482); 

Ac-R-R-Q-S-R-A-A-(therapeutic agent) (SEQ ID NO: 105); 
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Ac-R-R-Q-S-R-l-(therapeutic agent) (SEQ ID NO: 610); 

Ac-R-R-Q-S-R-S-S-L-(therapeutic agent) |SEQ ID NO: 543); 

Ac-R-R-Q-S-R-S-L-(therapeutic agent) (SEQ ID NO: 544); 

Ac-R-G-S-G-R-S-L-(therapeutic agent) (SEQ ID NO: 545); 
5 Ac-R-G-S-G-R--S-nL-(therapeutic agent) (SEQ ID NO: 546); 

Ac-R-G-S-G-R-A-nL-(therapeutic agent) (SEQ ID NO: 547); 

Ac-R-G-S-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 548); 

Ac-l-V-S-G-R-A-S-L-(therapeutic agent) (SEQ ID NO: 549); 

Ac-R-R-Q-S-R-A-(therapeutic agent) (SEQ ID NO: 108); 
10 Ac-R-R-Q-S-R-l-(therapeutic agent) (SEQ ID NO: 111); 

Ac-L-R-R-Q-S-R-A-A-(therapeutic agent) (SEQ ID NO: 106); 

Ac-L-R-R-Q-S-R-G-G-(therapeutic agent) (SEQ ID NO: 109); 

Ac-L-R-R-Q-S-R-A-(therapeutic agent) (SEQ ID NO: 110); 

Ac-L-R-R-Q-S-R-A-I-{therapeutlc agent) (SEQ ID NO: 112); 
15 Ac-L-R-R-Q-S-R-A-I- (therapeutic agent) (SEQ ID NO: 61 1); 

Ac-L-R-R-Q-S-R-S-S-L-(therapeutic agent) (SEQ ID NO: 550); 

Ac-L-R-R-Q-S-R-S-L-(therapeutlc agent) (SEQ ID NO: 551); 

Ac-S-G-R-S-L-(therapeutic agent) (SEQ ID NO: 362); 

Ac-S-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 363); 
20 Ac-S-G-R-S-S-S-L-(therapeutic agent) (SEQ ID NO: 364); 

Ac-S-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 365); 

Ac-S-G-R-S-nV-(therapeutic agent) (SEQ ID NO: 366); isomer 1 

Ac-S-G-R-S-nV-(therapeutic agent) (SEQ ID NO: 367); isomer 2 

Ac-S-G-R-S-G(hex)-(therapeutic agent) (SEQ ID NO: 368); 
25 Ac-S-G-R-S-Cha-(therapeutic agent) (SEQ ID NO: 369); 

Ac-S-G-R-S-hCha-(therapeutic agent) (SEQ ID NO: 370); 

Ac-S-A-R-S-L-(therapeutic agent) (SEQ ID NO: 371); 

Ac-S-A-R-S-S-L-(therapeutic agent) (SEQ ID NO: 372); 

Ac-S-S-R-S-nL-(therapeutic agent) (SEQ ID NO: 373); 
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Ac-T-G-R-S-Abu-(therapeutic agent) (SEQ ID NO: 374); 

Ac-T-G-R-S-L-(therapeutic agent) {SEQ ID NO: 375); 

Ac-T-G-R-S-nV-(therapeutic agent) (SEQ ID NO: 376); 

Ac-T-G-R-S-nL-(therapeutic agent) {SEQ ID NO: 377); 
5 Ac-T-G-R-S-G(hex)-(therapeutic agent) {SEQ ID NO: 378); 

Ac-T-G-R-S-Cha-(therapeutic agent) (SEQ ID NO: 379); 

Ac-T-G-R-S-hCha-(therapeutic agent) (SEQ ID NO: 380); 

Ac-T-G-R-T-Abu-{therapeutic agent) (SEQ ID NO: 381); 

Ac-T-G-R-hS-nL-(therapeutic agent) (SEQ ID NO: 382); 
10 Ac-T-G-R-Abu-nL-{therapeutic agent) (SEQ ID NO: 383); 

Ac-T-G-R-Abu-nV-(therapeutic agent) {SEQ ID NO: 384); 

Ac-T-G-F{Gn)-S-nL-(therapeutic agent) {SEQ ID NO: 385); 

Ac-T-G-F(Gn)-S-Cha-(therapeutic agent) {SEQ ID NO: 386); 

Ac-T-G-F(Gn)-Abu-nV-(therapeutic agent) (SEQ ID NO: 387); 
15 Ac-T-G-K{a!loc)-S-nL-(therapeutic agent) (SEQ ID NO: 388); 

Ac-T-G-K-S-nL-{therapeutic agent) {SEQ ID NO: 389); 

Ac-T-G-hR-S-nL-(therapeutic agent) {SEQ ID NO: 390); 

Ac-(hS)G-G-R-S-nL-{therapeutic agent) {SEQ ID NO: 391); 

MeOCO-T-G-R-S-nL-(therapeutlc agent) (SEQ ID NO: 392); 
20 PhS02-T-G-R-S-nL-(therapeutlc agent) (SEQ ID NO: 393); 

MeOEtCO-T-G-R-S-nL-(therapeutic agent) {SEQ ID NO: 394); 

MeO{EtO)2Ac-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 395); 

4-oxo-Pentanoyl-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 396); 

3,4-MethyldioxyPhAc-T-G-R-S-nL-{therapeutic agent) (SEQ ID NO: 397); 
25 2-PyridylAc-T-G-R-S-nL-{therapeutic agent) {SEQ ID NO: 398); 

PhOAc-T-G-R-S-nL-(therapeutlc agent) (SEQ ID NO: 399); 

L-3-PhLactyl-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 400); 

MeOAc-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 401); 

PhAc-T-G-R-S-nL-(therapeutic agent) {SEQ ID NO: 402); 
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MeOEtOCO-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 403); 

MeOEtOAc-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 404); 

HOOCButa-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 405); 

Z-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 406); 
5 EtOCO-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 407); 

^A-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 408); 

Pent-4-ynoyl-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 409); 

NapAc-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 410); 

iBoc-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 411); 
10 HOAc-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 41 2); 

MeSucc-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 413); 

N,N-diMeGIv-T-G-R-S-nL-(therapeutlc agent) (SEQ ID NO: 414); 

Succ-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 415); 

HCO-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 416); 
15 Ac-T-A-R-S-nL-(therapeutic agent) (SEQ ID NO: 417); 

Ac-T-A-F(Gn)-S-nL-(therapeutic agent) (SEQ ID NO: 418); 

Ac-T-A-R-Abu-nV-(therapeutic agent) (SEQ ID NO: 419); 

Ac-T-A-R-S-Abu-(therapeutic agent) (SEQ ID NO: 420); 

Ac-T-A-R-T-Abu-(therapeutic agent) (SEQ ID NO: 421); 
20 Ac-T-S(0-Me)-R-S-nL-(therapeutic agent) (SEQ ID NO: 422); 

Ac-T-hS-R-S-nL-(therapeuttc agent) (SEQ ID NO: 423); 

Ac-T-(1-Me)H-R-S-nL-(therapeutic agent) (SEQ ID NO: 424); 

Ac-T-(3-Me)H-R-S-nL-(therapeutic agent) (SEQ ID NO: 425); 

Ac-T-H-R-S-nL-(therapeutic agent) (SEQ ID NO: 426); 
25 Ac-T-Sar-R-S-nL-(therapeutic agent) (SEQ ID NO: 427); 

Ac-T-nV-R-S-nL-(therapeutic agent) (SEQ ID NO: 428); 

Ac-T-nL-R-S-nL-(therapeutic agent) (SEQ ID NO: 429); 

Ac-T-A-R-S-Cha-(therapeutic agent) (SEQ ID NO: 430); 

Ac-T-Abu-R-S-nL-(therapeutic agent) (SEQ ID NO: 431); 
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Ac-4,4diMeThr-G-R-S-nL-(therapeutic agent) {SEQ ID NO: 432); 

Ac-hS-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 433); 

Ac-hS-G-R-hS-Cha-{therapeutic agent) (SEQ ID NO: 434); 

Ac-hS-G-R-S-Cha-(therapeutic agent) (SEQ ID NO: 435); 
5 Ac-hS-G-R-T-Cha-(therapeutic agent) {SEQ ID NO: 436); 

Ac-hS-A-R-S-Cha-(therapeutic agent) (SEQ ID NO: 437); 

Ac-N-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 438); 

Ac-Y-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 439); 

Ac-Y-G-R-S-Cha-(therapeutic agent) (SEQ ID NO: 440); 
10 Ac-Q-G-R-S-S-nL-(therapeutic agent) (SEQ ID NO: 441); 

Ac-Q-G-R-S-S-nV-(therapeutic agent) (SEQ ID NO: 442); 

Ac-L-R-G-S-G-R-S-A-(therapeutic agent) (SEQ ID NO: 573); 

Ac-L-R-G-S-G-R-S-L-(therapeutic agent) {SEQ ID NO: 342); 

Ac-L-R-G-S-G-R-S-L-(therapeutic agent) {SEQ ID NO: 343); 
15 Ac-L-R-G-S-G-R-S-S-nL-{therapeutic agent) (SEQ ID NO: 344); 

Ac-L-R-G-S-G-R-S-S-Cha-(therapeutic agent) {SEQ ID NO: 345); 

Ac-L-R-G-dS-A-R-S-A-{therapeutic agent) {SEQ ID NO: 574); 

Ac-L-R-G-S-A-R-S-S-L-(therapeutic agent) (SEQ ID N0:346 ); 

Ac-L-R-G-S-A-R-S-L-(therapeutlc agent) (SEQ ID NO: 347); 
20 Ac-L-R-G-S-A-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 348); 

Ac-L-R-G-S-A-R-S-S-nV-{therapeutic agent) {SEQ ID NO: 349); 

Ac-L-R-G-S-A-R-S-S-nL-(therapeutic agent) (SEQ ID NO: 350); 

Ac-V-l-V-S-G-R-A-L-(therapeut!C agent) (SEQ ID NO: 351); 

Ac-V-l-V-S-A-R-S-L-(therapeutlc agent) (SEQ ID NO: 352); 
25 Ac-V-l-V-S-G-R-S-S-L-{therapeutic agent) (SEQ ID NO: 353); 

Ac-V-l-V-S-A-R-M-A-{therapeutic agent) (SEQ ID NO: 354); 

Ac-V-l-V-S-A-R-nL-A-(therapeutic agent) (SEQ ID NO: 355); 

Ac-V-l-V-S-A-R-S-nL-(therapeutic agent) (SEQ ID NO: 356); 

Ac-V-l-V-S-A-R-S-Cha-{therapeutlc agent) (SEQ ID NO: 357); 
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Ac-V-l-V-S-A-R-S-Cha-(therapeutic agent) (SEQ ID NO: 358); 

Ac-V-l-V-S-A-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 359); 

Ac-R-R-(Me)C-P-G-R-V-V-(therapeutic agent) (SEQ ID NO: 360); 

Ac-R-R-nV-P-A-R-S-L-(therapeutic agent) (SEQ ID NO: 361); 
5 Ac-R-G-dS-A-R-S-A-(therapeutic agent) (SEQ ID NO: 309); 

Ac-R-G-S-G-R-S-A-{therapeutic agent) (SEQ ID NO: 310); 

Ac-R-G-S-G-R-A-L-(therapeutic agent) (SEQ ID NO: 31 1); 

Ac-R-G-S-G-R-S-L-(therapeutic agent) (SEQ ID NO: 312); 

Ac-R-G-S-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 313); 
10 Ac-R-G-S-G-R-A-nL-(therapeutic agent) (SEQ ID NO: 314); 

Ac-R-G-S-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 315); 

Ac-R-G-S-G-R-S-Cha-{therapeutic agent) (SEQ ID NO: 316); 

Ac-R-G-S-G-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 317); 

Ac-R-G-S-A-R-S-Cha-(therapeutic agent) (SEQ ID NO: 318); 
15 Ac-R-G-S-A-R-S-S-(therapeutic agent) (SEQ ID NO: 319); 

Ac-R-G-S-A-R-S-nV-(therapeutlc agent) (SEQ ID NO: 320); 

Ac-R-G-S-A-R-S-S-nV -(therapeutic agent) (SEQ ID NO: 321); 

Ac-R-G-S-A-R-S-L-(therapeutic agent) (SEQ ID NO: 322); 

Ac-R-(Me)C-P-G-R-V-V-(therapeutic agent) (SEQ ID NO: 323); 
20 Ac-R-(Me)C-P-G-R-V-V-(therapeutic agent) (SEQ ID NO: 324); 

Ac-R-C(Me)-P-G-R-S-L-(therapeutic agent) (SEQ ID NO: 325); 

Ac-R-L-P-G-R-S-L-{therapeutic agent) (SEQ ID NO: 326); 

Ac-R-V-P-G-R-S-L-(therapeutic agent) (SEQ ID NO: 327); 

Ac-R-V-P-G-R-S-L-{therapeutic agent) (SEQ ID NO: 328); 
25 Ac-R-nL-P-G-R-S-L-(therapeutic agent) (SEQ ID NO: 329); 

Ac-R-G(tBu)-P-A-R-S-L-(therapeutic agent) (SEQ ID NO: 330); 

Ac-R-L-P-A-R-S-L-(therapeutic agent) (SEQ ID NO: 331); 

Ac-R-V-P-A-R-S-L-(therapeutrc agent) (SEQ ID NO: 332); 

Ac-R-nL-P-A-R-S-L-(therapeutic agent) (SEQ ID NO: 333); 
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Ac-I-V-S-G-R-A-L-(therapeutic agent) (SEQ ID NO: 334); 

Ac-l-V-S-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 335); 

Ac-l-V-S-G-R-A-S-L-(therapeutic agent) (SEQ ID NO: 336); 

Ac-l-V-S-A-R-M-A-(therapeutic agent) (SEQ ID NO: 337); 
5 Ac-l-V-S-A-R-nL-A-(therapeutic agent) (SEQ ID NO: 338); 

Ac-l-V-S-A-R-S-L-(therapeutic agent) (SEQ ID NO: 339); 

Ac-l-V-S-A-R-S-nL-(therapeutic agent) (SEQ ID NO: 340); 

Ac-l-V-S-A-R-S-S-L-(therapeutic agent) (SEQ ID NO: 341); 

Ac-G-S-G-R-S-A-(therapeuttc agent) (SEQ ID NO: 585); 
10 Ac-G-S-G-R-S-L-(therapeutic agent) (SEQ ID NO: 277); 

Ac-G-S-G-R-A-L-(therapeutic agent) (SEQ ID NO: 278); 

Ac-G-S-G-R-S-S-L-(therapeutic agent) (SEQ ID NO: 279); 

Ac-G-S-G-R-L-(therapeutfc agent) (SEQ ID NO: 280); 

Ac-G-S-G-(4-guan)Phg-S-L-(therapeutic agent) (SEQ ID NO: 281); 
15 Ac-G-S-G-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 282); 

Ac-G-S-G-R-A-S-L-(therapeutic agent) (SEQ ID NO: 283); 

Ac-G-S-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 284); 

Ac-G-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 285); 

Succ-bA-T-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 286); 
20 Ac-G-T-G-R-S-hCha-(therapeutic agent) (SEQ tD NO: 287); 

Ac-G-hS-G-R-S-nL-(therapeutic agent) (SEQ ID NO: 288); 

Ac-G-dS-A-R-S-A-(therapeutic agent) (SEQ ID NO: 289); 

Ac-G-S-A-R-S-L-(therapeutic agent) (SEQ ID NO: 290); 

Ac-G-S-A-R-S-S-Cha-(therapeutic agent) (SEQ ID NO: 291); 
25 Ac-G-S-A-R-S-S-L-(therapeutic agent) (SEQ ID NO: 292); 

Ac-G-S-A-R-A-S-L-(therapeutic agent) (SEQ ID NO: 293); 

Ac-V-S-G-R-S-L-(therapeutlc agent) (SEQ ID NO: 294); 

Ac-V-S-G-R-A-L-(therapeutic agent) (SEQ ID NO: 295); 

Ac-V-S-G-R-A-S-L-(therapeutic agent) (SEQ ID NO: 296); 
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Ac-V-S-G-R-S-S-L-(therapeutic agent) {SEQ ID NO: 297); 

Ac-V-S-A-R-M-A-{therapeutic agent) (SEQ ID NO: 298); 

Ac-V-S-A-R-nL-A-(therapeutic agent) (SEQ ID NO: 299); 

Ac-V-S-A-R-S-nL-(therapeutic agent) (SEQ ID NO: 300); 
5 Ac-V-S-A-R-S-L-(therapeutic agent) (SEQ ID NO: 301); 

Ac-(Me)C-P-G-R-V-V-(therapeutic agent) (SEQ ID NO: 302); 

Ac-(Me)C-P-G-R-V-V-(therapeutic agent) (SEQ ID NO: 303); 

Ac-C(l\/le)-P-G-R-A-L- (therapeutic agent) (SEQ ID NO: 304); 

Ac-C(Me)-P-G-R-S-L-(therapeutic agent) (SEQ ID NO: 305); 
10 Ac-C(Me)-P-A-R-S-L-(therapeutic agent) (SEQ ID NO: 306); 

Ac-C(Me)-P-A-R-A-S-L-(therapeutlc agent) (SEQ ID NO: 307); 

Ac-G(tBu)-P-G-R-S-L-(therapeiJtic agent) (SEQ ID NO: 308); 

Ac-Q-S-R-A-A-(therapeutic agent) (SEQ ID NO: 552); 

Ac-Q-S-R-S-A-(therapeutic agent) (SEQ ID NO: 553); 
15 Ac-Q-S-R-S-G-(therapeutic agent) (SEQ ID NO: 554); 

Ac-R-S-R-A-A-(therapeutic agent) (SEQ ID NO: 555); 

Ac-R-Q-S-R-A-A-(therapeutic agent) (SEQ ID NO: 556); 

Ac-R-Q-S-R-S-A-(therapeutic agent) (SEQ ID NO: 557); 

Ac-R-Q-S-R-S-A-A-(therapeutic agent) (SEQ ID NO: 558); 
20 Ac-R-G-S-G-R-S-A-(therapeutic agent) (SEQ ID NO: 559); 

Ac-S-G-R-A-A-(therapeutic agent) (SEQ ID NO: 560); 

Ac-S-G-R-S-A-(therapeutic agent) (SEQ ID NO: 561); 

Ac-S-G-R-S-S-A-(therapeutic agent) (SEQ ID NO: 562); 

Ac-S-G-R-A-S-A-(therapeutic agent) (SEQ ID NO: 563); 
25 Ac-S-G-R-S-G-(therapeutic agent) (SEQ ID NO: 564); 

Ac-S-G-R-S-S-G-(therapeutic agent) (SEQ ID NO: 565); 

Ac-S-G-R-S-G-A-(therapeutic agent) (SEQ ID NO: 566); 

Ac-S-G-R-S-G-G-(therapeutic agent) (SEQ ID NO: 567); 

Ac-G-T-G-R-S-G-G-(therapeLJtic agent) (SEQ ID NO: 568); 



wo 02/095007 



PCT/US02/16819 



-272- 

Ac-G-S-G-R-S-G-G-(therapeutic agent) (SEQ ID NO: 243) 

Ac-L-R-R-Q-S-R-A-A-(therapeutic agent) (SEQ ID NO: 597); 

MeS02-dA{Chx)-Abu-R-S-L-(therapeutic agent) (SEQ ID NO: 598); 

Ac-R-A-R-S-L-(therapeutic agent) (SEQ ID NO: 599); 
5 Ac-dA(Chx)-Abu-R-S-L-{therapeutic agent) (SEQ ID NO: 600); 

Ac-dA(Chxl-Abu-R-S-S-L-(therapeutic agent) (SEQ ID NO: 601); 

Ac-a-G-R-S-S-L-{therapeutic agent) (SEQ ID NO: 602); 

Me0C0-dhF-P(0H)-R-S-S-L-(therapeutic agent) (SEQ ID NO: 603); 

Me0C0-Quat4-G-R-S-L-(therapeutic agent) (SEQ ID NO: 604); 
10 Ac-dCha-P(OH)-R-S-S-L-(therapeutic agent) (SEQ ID NO: 605); 

Ac-dCha-Abu-R-S-S-A-(therapeutic agent) (SEQ ID NO: 606); 

MeOCO-Quat2-G-R-S-L-(therapeutic agent) (SEQ ID NO: 607); 

MeOCO-Quat3-G-R-S-L-(therapeutic agent) (SEQ ID NO: 608); and 

MeOCO-Quat-G-R-S-L-{therapeutic agent) (SEQ ID NO: 609). 
15 105. The conjugate of any of claims 35-56, wherein P4 is 

selected from Pro, Arg, Ser, Ala, Lys, Gly, nLeu, Leu, Tyr, Glu, Phe, Val, 

N,N-dimethylGiy, >?-Ala, Cys(Me), Gin, t-butylGly and nVal. 

106. The conjugate of claim 1 or 66, that comprises a peptldic 
substrate of the formula P6-P5-P4-P3-P2-P1-Pr-P2'-P3'-P4', wherein 

20 each of Pi , P2, P3, P4, P5, P6, PI ' and P2' are selected from residues 
set forth in Figures 1 and 2, and P6, P5, P4, P2', P3' and P4' are 
optional. 

107. The conjugate of claim 67, wherein: 

P6 is optional and is selected from L, V, R; 
25 P5 is optional and is selected from R, I, L; 

P4 is optional and is selected from G, C, V; 
P3 is selected from S, dS, P, A or G; 
P2 is selected from A or G; 
PI is R; 
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Pr is S, V, M or nL; 

P2* is optional and is selected S, L, A or V; 
P3' is optional and is L; and 
P4' is optional and is L. 
5 108. A conjugate, comprising a therapeutic agent and a nucleic 

acid substrate linked thereto via a peptidic linker, wherein the peptidic 
linker is proteolytically cleaved by a cell surface protease or a soluble, 
released or shed form thereof, to liberate the therapeutic agent, wherein 
the conjugate is not substantially cleaved by plasmin or prostate specific 
10 antigen (PSA). 

109. The conjugate of claim 108, wherein the nucleic acid is 

DNA. 

110. The conjugate of claim 108, wherein the nucleic acid is 

RNA. 

15 111. The conjugate of claim 108, wherein the nucleic acid is 

double-stranded RNA. 

1 12. The conjugate of claim 67, wherein: 

P6 is optional and is selected from L, V, R; 
P5 is optional and is selected from R, I, L; 
20 P4 is optional and is selected from G, C, V; 

PS is selected from S, dS, P, A or G; 
P2 is selected from A or G; 
PI is R; 

PI ' is T, Abu, hS, nV or A; 
25 P2' is optional and is selected S, L, A or V; 

P3' is optional and is L, nL, nV, G(hex), G(allyl), CHA, hCHA, 

or Abu; and 

P4' is optional and is L, nL, nV, G{hex), G(allyl), CHA, hCHA, 

or Abu. 
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113. The conjugate of claim 67, wherein: 



P6 is optional and is selected from U V, R; 
P5 is optional and is selected from R, I, L; 
P4 is optional and is selected from G, C, V; 



5 



P3 



s selected from S, dS, P, A or G; 



P2 



is selected from A or G; 



PI 



s R; 



PV is S, G or A; 



10 



P2' is optional and is selected G or A; 

P3' is optional and is L, nU nV, G(hex}, G(allyO, CHA, hCHA, 



or Abu; and 



P4' is optional and is L, nU nV, G(hex), G(aliy!), CHA, hCHA, 



or Abu. 



114. The conjugate of any of claims 1-113, wherein the 



15 therapeutic agent is taxol. 

115. The conjugate of any of claims 1-113, wherein the 
therapeutic agent is doxorubicin. 

116. A method of treatment of a disease, comprising 
administering a conjugate of any of claims 1-1 13 to a subject, wherein 

20 the disease is a cell-surface protease-associated disease. 

117. The method of claim 116, wherein the disease is selected 
from the group consisting of autoimmune disesases, inflammatory 
diseases, infectious diseases and endocrine diseases. 



25 proliferative disease. 

119. A method of treatment of a cell-surface protease-associated 
disease, comprising administering a conjugate, comprising a therapeutic 
agent and a peptidic substrate linlced thereto optionally via a linker, 
wherein the peptidic substrate is proteolytically cleaved by a cell surface 



118. The method of claim 116, wherein the disease is a 
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protease or a soluble, released or shed form thereof to liberate the 
therapeutic agent, to a subject exhibiting symptoms of a cell-surface 
protease-associated disorder. 

1 20. The method of claim 119, wherein the disease is selected 
5 from the group consisting of autoimmune disesases, inflammatory 

diseases, infectious diseases and endocrine diseases. 

121 . The method of claim 1 19, wherein the disease is a 
proliferative disease. 

122. The method of any of claims 1 14-1 1 9, wherein the subject 
10 is a mammal. 

123. The method of claim 120, wherein the mammal is a human. 

124. The method of claim 118 or 121, wherein the disease is 
cancer. 

125. The method of claim 118 or 121, wherein the disease is 
15 selected from ocular disorders, cardiovascular disorders, chronic 

inflammatory diseases, wounds, circulatory disorders, dermatological 
disorders and cancer. 

1 26. The method of claim 118 or 121, wherein the disease is 
selected from rheumatoid arthritis, psoriasis, diabetic retinopathies, 

20 recurrence of pterygii, scarring from excimer laser surgery, scarring from 
glaucoma filtering surgery, macular degeneration anterior eye, crest 
syndromes, solid neoplasms and vascular tumors. 

127. The method of claim 118 or 121, wherein the disease is 
selected from lung cancer, colon cancer, pancreatic cancer, esophageal 

25 cancer, breast cancer, ovarian cancer, prostate cancer, melanoma and 
Kaposi's sarcoma. 

1 28. The method of any of claims 1 1 6-1 27, wherein the 
therapeutic agent is taxol. 
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129. The method of any of claims 116-127, wherein the 
therapeutic agent is doxorubicin. 

130. A pharmaceutical composition, comprising the conjugate of 
any of claims 1-113 or a pharmaceutically acceptable derivative thereof, 

5 in a pharmaceutically acceptable carrier. 

131. The pharmaceutical composition of claim 130 that is 
formulated for single dosage administration. 

132. An article of manufacture, comprising packaging material, 
the conjugate of any of claims 1-113, or a pharmaceutically acceptable 

10 derivative thereof, contained within packaging material, which is used for 
treatment, prevention or amelioration of one or more symptoms 
associated with cell-surface protease-associated diseases or disorders, 
and a label that indicates that the conjugate or pharmaceutically 
acceptable derivative thereof is used for treatment, prevention or 

15 amelioration of one or more symptoms associated with cell-surface 
protease-associated diseases or disorders. 

133. The conjugate of any of claims 1-113 when used for the 
treatment of a cell-surface protease-associated disease. 

134. The conjugate of claim 133, wherein the disease is a 
20 proliferative disease. 

135. The conjugate of claim 134, wherein the proliferative disease 
is cancer. 

1 36. The conjugate of claim 1 34, wherein the proliferative disease 
is selected from ocular diseases, cardiovascular diseases, chronic 

25 inflammatory diseases, wounds, circulatory diseases, dermatological 
diseases and cancer. 

137. The conjugate of claim 134, wherein the proliferative disease 
is selected from rheumatoid arthritis, psoriasis, diabetic retinopathies, 
recurrence of pterygii, scarring from excimer laser surgery, scarring from 
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glaucoma filtering surgery, macular degeneration anterior eye, crest 
syndronnes, solid neoplasms and vascular tumors. 

138. The conjugate of claim 134, wherein the proliferative disease 
is selected from lung cancer, colon cancer, pancreatic cancer, esophageal 

5 cancer, breast cancer, ovarian cancer, prostate cancer, melanoma and 
Kaposi's sarcoma. 

139. Use of the conjugate of any of claims 1-113 for the 
preparation of a medicament for use in the treatment of a cell-surface 
protease-associated disease. 

10 140. The use of claim 139, wherein the disease is a proliferative 

disease. 

141. The use of claim 140, wherein the proliferative disease is 
cancer. 

142. The use of claim 140, wherein the proliferative disease is 
15 selected from ocular diseases, cardiovascular diseases, chronic 

inflammatory diseases, wounds, circulatory diseases, dermatological 
diseases and cancer. 

143. The use of claim 140, wherein the proliferative disease is 
selected from rheumatoid arthritis, psoriasis, diabetic retinopathies, 

20 recurrence of pterygii, scarring from excimer laser surgery, scarring from 
glaucoma filtering surgery, macular degeneration anterior eye, crest 
syndromes, solid neoplasms and vascular tumors. 

144. The use of claim 140, wherein the proliferative disease is 
selected from lung cancer, colon cancer, pancreatic cancer, esophageal 

25 cancer, breast cancer, ovarian cancer, prostate cancer, melanoma and 
Kaposi's sarcoma. 

145. A method of preparing a conjugate of any of claims 1-113, 
comprising: 

a) synthesizing the peptidic substrate; 
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b) optionally capping the peptidic substrate on either the N- 
terminus or the C-terminus; 

c) optionally linking the non-capped terminus of the peptidic 
substrate to a linker; 

5 d) coupling the peptidic substrate to a therapeutic agent, 

optionally via the linker, to form a conjugate; and 

e) optionally, deprotecting the conjugate, if protected. 

146. The method of claim 145, wherein, prior to step a), the 
method comprises a step of identifying a peptidic substrate for the 

10 protease. 

147. A method, comprising: 

a) selecting a disease; 

b) identifying a cell involved in the disease process or a cell in the 
vicinity of the cell involved in the disease process; and 

15 c) identifying a cell surface protease on the cell, thereby 

identifying proteases to target conjugates for treatment of diseases. 

148. The method of claim 147, further comprising preparing a 
conjugate that targets the protease. 
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CONJUGATE 


MTSP1 CT50 


SEQ !D NO 


Ac-Y-G-R-S-S-L-Dox 


B 


481 


Ac-M(02)-S-R-S-L-Dox 


C 


482 


Ac-R-R-Q-S-R-A-A-Dox 


A 


105 


Ac-R-R-Q-S-R-I-Dox; isomer 1 


D 


610 


Ac-R-R-Q-S-R-S-S-L-Dox 


A 


543 


Ac-R-R-Q-S-R-S-L-Dox 


A 


544 


Ac-R-G-S-G-R-S-L-Dox 


B 


545 


Ac-R-G-S-G-R-S-nL-Dox 


A 


.546 


Ac-R-G-S-G-R-A-nL-Dox 


A 


547 


Ac-R-G-S-G-R-S-S-L-Dox 


B 


548 


Ac-I-V-S-G-R-A-S-L-Dox 


C 


549 


Ac-R-R-Q-S-R-A-Dox 


NT 


108 


Ac-R-R-Q-S-R-I-Dox; isomer 2 


NT 


111 


Ac-L-R-R-Q-S-R-A-A-Dox 


A 


106 


Ac-L-R-R-Q-S-R-G-G-Dox 


B 


109 


Ac-L-R-R-Q-S-R-A-Dox 


C 


110 


Ac-L-R-R-Q-S-R-A-I-Dox; isomer 1 


A 


112 


Ac-L-R-R-Q-S-R-A-I-Dox; isomer 2 


C 


611 


Ac-L-R-R-Q-S-R-S-S-I-Dox 


A 


550 


Ac-L-R-R-Q-S-R-S-L-Dox 


A 


551 



FIG. lA 
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CONJUGATE 


MTSP1 CT50 


SEQ ID NO 


Ac-Q-S-R-S-S-nV-Dox 


B 


457 


Ac-Q-S-R-S-S-Cha-Dox 


B 


458 


Ac-Q-S-R-S-S-L-Dox 


B 


459 


Ac-Q-T-R-S-S-L-Dox 


C 


460 


Ac-Q-Aib-R-S-S-Cha-Dox 


C 


461 


Ac-Q-Aib -R-S-S-L-Dox 


D 


462 


Ac-Q-Abu-R-S-S-Cha-Dox 


B 


463 


Ac-Q-Abu-R-S-S-L-Dox 


B 


464 


Ac-Q-Cha-R-S-S-Cha-Dox 


D 


465 


Ac-Q-F-R-S-L-Dox 


C 


466 


Ac-Q-F-R-S-S-L-Dox 


B 


467 


Ac-Q-Y-R-S-S-L-Dox 


C 


468 


Ac-R-G-R-S-L-Dox 


A 


469 


Ac-R-G-R-S-S-L-Dox 


A 


470 


Ac-R-G-R-S-S-Cha-Dox 


A 


471 


Ac-R-G-R-S-Cha-Dox 


A 


472 


Ac-R-A-R-S-L-Dox 


B 


473 


Ac-R-A- R-S-S-L- Dox 


A 




Ac-R-S-R-S-L-Dox 


B 


475 


Ac-R-S-R-S-S-L-Dox 


B 


476 


Ac-R-S-R-S-Cha-Dox 


B 


477 


Ac-R-S-R-S-S-Cha-Dox 


B 


478 


Ac-R-F-R-S-L-Dox 


B 


479 


AoR-F-R-S-Cha-Dox 


B 


480 



FIG. IB 
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CONJUGATE MTSP1 CT50 SEQ ID NO 



Ac-R-Q-F-R-A-nV-Dox 


B 


540 


Ac-R-Q-F-R-A-Cha-Dox 


D 


541 


Ac-Q-S-R-S-S-nL-Dox 


B 


542 


MeOCO-Quat2-G-R-S-L-NH2 


B 


483 


l\/leOCQ-Quat3-G-R-STL-N H2 


B 


484 


MeOCO-Quat-G-R-S-L-NH2 


C 


485 


MeOCO-Quat4-G-R-S-L-N H2 


B 


486 


MeOCO-Quat5-G-R-S-L-NH2 


C 


487 


MeOCO-Quat2-G-R-S-S-L-NH2 


B 


488 


MeOCO-Quat4-G-R-S-L-Dox 


B 


489 


MeOCO-Quat2-G-R-S-L-Dox 


B 


490 


Ac-Q-G-R-S-L-Dox 


C 


445 


Ac-Q-G-R-S-S-L-Dox 


B 


446 


Ac-Q-G-R-A-S-L-Dox 


B 


447 


Ac-N-G-R-S-S-L-Dox 


C 


448 


Ac-Q-G-R-S-S-nL-Dox 


B 


449 


Ac-O-G-R-S-S-nV-Dox 




450 


Ac-Q-G-R-S-S-Cha-Dox 


B 


451 


Ac-Q-G-R-S-S-ailylG-Dox; isomer 1 


B 


452 


Ac-Q-G-R-S-S-allylG-Dox; isomer 2 


B 


453 


Ac-Q-A-R-S-L-Dox 


C 


454 


Ac-Q-A-R-S-S-L-Dox 


B 


455 


Ac-Q-S-R-S-L-Dox 


C 


456 



FIG. IC 
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CONJUGATE 


WITSP1 CT50 


SEQ ID NO 


Ac-R-Q-S-R-A-A-Dox 


A 


515 


Ac-R-Q-S-R-A-Dox 


D 


516 


Ac-R-Q-S-R-A-nL-Dox 


A 


517 


Ac-R-Q-S-R-A-L-Dox 


A 


519 


Ac-R-Q-S-R-A-nV-Dox 


A 


520 


Ac-R-Q-S-R-A-Cha-Dox 


B 


521 


Ac-R-Q-S-R-S-S-L-Dox 


A 


522 


Ac-R-Q-S-R-S-L-Dox 


A 


523 


Ac-R-Q-S-R-S-dnL-Dox 


A 


524 


Ac-R-Q-S-R-S-dnL-Dox 


C 


525 


Ac-R-Q-S-R-S-nV-Dox 


A 


526 


Ac-R-Q-S-R-S-allylG-Dox 


A 


527 


Ac-R-Q-S-R-S-Cha-Dox 


B 


528 


Ac-R-Q-S-R-T-nL-Dox 


A 


529 


Ac-R-Q-T-R-S-S-L-Dox 


B 


530 


Ac-R-Q-T-R-S-L-Dox 


B 


531 


Ac-R-N-S-R-S-nL-Dox 


B 


532 


Ac-R-Q-F-R-S-L-Dox 


B 


533 


Ac-R-Q-F-R-S-nL-Dox 


A 


534 


Ac-R-Q-F-R-S-nV-Dox 


A 


535 


Ac-R-Q-F-R-S-nL-Dox 


D 


536 


Ac-R-Q-F-R-S-Cha-Dox 


D 


537 


Ac-R-Q-F-R-A-L-Dox 


C 


538 


Ac-R-Q-F-R-A-nL-Dox 


C 


539 



FIG. ID 
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CONJUGATE 


WITSP1 CT50 


SEQ ID NO 


Ac-R-Q-G-R-S-L-Dox 


A 


491 


Ac-R-Q-G-R-S-S-L-Dox 


A 


492 


Ac-R-Q-G-R-S-nL-Dox 


A 


493 


Ac-R-Q-G-R-S-nV-Dox 


A 


494 


Ac-R-Q-G-R-S-F-Dox 


A 


495 


Ac-R-Q-G-R-A-L-Dox 


A 


496 


Ac-R-Q-G-R-A-dL-Dox 


D 


497 


Ac-R-Q-G-R-A-dnL-Dox 


A 


498 


Ac-R-Q-G-R-A-nL-Dox 


B 


499 


Ac-R-Q-G-R-A-nV-Dox 


A 


500 


Ac-R-Q-G-R-A-C ha-Dox 


A 


501 


Ac-R-Q-G-R-A-F-Dox 


D 


502 


Ac-R-N-G-R-S-L-Dox 


B 


503 


Ac-R-N-G-R-A-nL-Dox 


A 


504 


Ac-R-Q-A-R-S-L-Dox 


B 


505 


Ac-R-Q-A-R-S-nL-Dox 


A 


506 


Ac-R-Q-A-R-S-nV-Dox 


A 


507 


Ac-R-Q-A-A-S-Cha-Dox 


B 


508 


Ac-R-Q-A-R-S-S-Cha-Dox 


A 


509 


Ac-R-Q-A-R-T-nL-Dox 


A 


510 


Ac-R-Q-A-R-A-L-Dox 


D 


511 


Ac-R-Q-A-R-A-nL-Dox 


A 


512 


Ac-R-Q-A-R-A-nV-Dox 


B 


513 


Ac-R-Q-A-R-A-Cha-Dox 


B 


514 



FIG. IE 
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CONJUGATE 


■ ■DA ^TCft 

UPA CT50 


OCU lu NO 


Ac-S-G-R-S-L-Dox 


C 


362 


Ac-S-G-R-S-S-L-Dox 


c 


363 


Ac-S-G-R-S-S-S-L-Dox 


D 


364 


Ac-S-G-R-S-nL-Dox 


B 


365 


Ac-S-G-R-S-nV-Dox; isomer 1 


B 


366 


Ac-S-G-R-S-nV-Dox; isomer 2 


D 


367 


Ac-S-G-R-S-G(hex)-Dox 


A 


368 


Ac-S-G-R-S-Cha-Dox 


B 


369 


Ac-S-G-R-S-hCha-Dox 


A 


370 


Ac-S-A-R-S-L-Dox 


D 


371 


Ac-S-A-R-S-S-L-Dox 


D 


372 


Ac-S-S-R-S-nL-Dox 


C 


373 


Ac-T-G-R-S-Abu-Dox 


A 


374 


Ac-T-G-R-S-L-Dox 


B 


375 


Ac-T-G-R-S-nV-Dox 


A 


376 


Ac-T-G-R-S-nL-Dox 


A 


377 


Ac-T-G-R-S-G(hex)-Dox 


A 


378 


Ac-T-G-R-S-Cha-Dox 


B 


379 


Ac-T-G-R-S-hCha-Dox 


A 


380 


Ac-T-G-R-T-Abu-Dox 


B 


381 


Ac-T-G-R-hS-nL-Dox 


B 


362 


Ac-T-G-R-Abu-nL-Dox 


A 


383 


Ac-T-G-R-Abu-nV-Dox 


B 


384 


Ac-T-G-F(Gn)-S-nL-Dox 


A 


385 



FIG. 2A 
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CONJUGATE 


uPA CT50 


SEQ ID NO 


Ac-T-G-F(Gn)-S-Cha-Dox 


A 


386 


Ac-T-G-F(Gn)-Abu-nV-Dox 


NT 


387 


Ac-T-G-K(alIoc)-S-nL-Dox 


D 


388 


Ac-T-G-K-S-nL-Dox 


B 


389 


Ac-T-G-hR-S-nL-Dox 


D 


390 


Ac-(hS)G-G-R-S-nL-Dox 


D 


391 


MeOCO-T-G-R-S-nL-Dox 


A 


392 


PhS02-T-G-R-S-nL-Dox 


B 


393 


MeOEtCO-T-G-R-S-nL-Dox 


A 


394 


MeO(EtO)2Ac-T-G-R-S-nL-Dox 


A 


395 


4-oxo-PentanoyI-T-G-R-S-nL-Dox 


A 


396 


3,4-MethyldioxyPhAc-T-G-R-S-nL-Dox 


A 


397 


2-PyridylAc-T-G-R-S-nL-Dox 


A 


398 


PhOAoT-G-R-S-nL-Dox 


A 


399 


L-3-PhLactyl-T-G-R-S-nL-Dox 


A 


400 


MeOAc-T-G-R-S-nL-Dox 


A 


401 


PhAo-T-G-R-S-nL-Dox 


A 


402 


MeOEtOCO-T-G-R-S-nL-Dox 


A 


403 


MeOEtOAc-T-G-R-S-nL-Dox 


A 


404 


HOOCButa-T-G-R-S-nL-Dox 


A 


405 


Z-T-G-R-S-nL-Dox 


A 


406 


EtOCO-T-G-R-S-nL-Dox 


A 


407 


pA-T-G-R-S-nL-Dox 


A 


408 


Pent-4-ynoyl-T-G-R-S-nL-Dox 


A 


409 



FIG. 2B 



wo 02/095007 



8/16 



PCT/US02/16819 



CONJUGATE 


uPA CToO 


SEQ ID NO 


NapAc-T-G-R-S-nL-Dox 


B 


410 


iBoc-T-G-R-S-nL-Dox 


A 


411 


HOAc-T-G-R-S-nL-Dox 


^ A 


412 


MeSucc-T-G-R-S-nL-Dox 


A 


413 


N,N-diMeGly-T-G-R-S-nL-Dox 


A 


414 


Succ-T-G-R-S-nL-Dox 


B 


415 


HCO-T-G-R-S-nL-Dox 


A 


416 


Ac-T-A-R-S-nL-Dox 


A 


417 


Ac-T-A-F(Gn)-S-nL-Dox 


A 


418 


Ac-T-A-R-Abu-nV-Dox 


NT 


419 


Ac-T-A-R-S-Abu-Dox 


B 


420 


Ac-T-A-R-T-Abu-Dox 


B 


421 


Ac-T-S(0-Me)-R-S-nL-Dox 


B 


422 


Ac-T-hS-R-S-nL-Dox 


6 


423 


Ac-T-(1 -Me)H-R-S-nL-Dox 


NT 


424 


Ac-T-(3-Me)H-R-S-nL-Dox 


NT 


425 


Ac-T-H-R-S-nL-Dox 


C 


426 


Ac-T-Sar-R-S-nL-Dox 


D 


427 


Ac-T-nV-R-S-nL-Dox 


D 


428 


Ac-T-nL-R-S-nL-Dox 


B 


429 


Ac-T-A-R-S-Cha-Dox 


B 


430 


Ac-T-Abu-R-S-nL-Dox 


B 


431 


Ac-4,4diMeThr-G-R-S-nL-Dox 


B 


432 


Ac-hS-G-R-S-nL-Dox 


D 


433 



FIG. 2C 
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CONJUGATE 


uPA CT50 


SEQ ID NO 


Ac-hS-G-R-hS-Cha-Dox 


D 


434 


Ac-hS-G-R-S-Cha-Dox 


D 


435 


Ac-hS-G-R-T-Cha-Dox 


D 


436 


Ac-hS-A-R-S-Cha-Dox 


D 


437 


Ac-N-G-R-S-nL-Dox 


D 


438 


AoY-G-R-S-S-L-Dox 


D 


439 


Ac-Y-G-R-S-Cha-Dox 


D 


440 


AoQ-G-R-S-S-nL-Dox 


D 


441 


Ac-Q-G-R-S-S-nV-Dox 


D 


442 


Ac-L-R-G-S-G-R-S-A-Dox 


B 


573 


Ac-L-R-G-S-G-R-S-L-Dox 


C 


342 


Ac-L-R-G-S-G-R-S-dL-Dox 


D 


343 


Ac-L-R-G-S-G-R-S-S-nL-Dox 


D 


344 


Ac-L-R-G-S-G-R-S-S-Cha-Dox 


C 


345 


Ac-L-R-G-dS-A-R-S-A-Dox 


C 


574 


Ac-L-R-G-S-A-R-S-S-L-Dox 


D 


346 


Ac-L-R-G-S-A-R-S-L-Dox 


C 


347 


Ac-L-R-G-S-A-R-S-S-Cha-Dox 


C 


348 


Ac-L-R-G-S-A-R-S-S-nV-Dox 


D 


349 


Ac-L-R-G-S-A-R-S-S-nL-Dox 


D 


350 


Ac-V-I-V-S-G-R-A-L-Dox 


D 


351 


Ac-V-I-V-S-A-R-S-L-Dox 


D 


352 


Ac-V-I-V-S-G-R-S-S-L-Dox 


C 


353 


Ac-V-I-V-S-A-R-M-A-Dox 


C 


354 



FIG. 2D 
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CONJUGATE 


uPA CT50 


SEQ ID NO 


Ac-V-l-V-S-A-R-nL-A-Dox 


D 


355 


Ac-V-l-V-S-A-R-S-nL-Dox 


D 


356 


Ac-V-I-V-S-A-R-S-Cha-Dox 


D 


357 


Ac-V-l-V-S-A-R-S-dCha-Dox 


D 


358 


Ac-V-I-V-S-A-R-S-S-Cha-Dox 


D 


359 


Ac-R-R-(Me)C-P-G-R-V-V-Dox 


D 


360 


Ac-R-R-nV-P-A-R-S-L-Dox 


D 


361 


Ac-R-G-dS-A-R-S-A-Dox 


C 


309 


Ac-R-G-S-G-R-S-A-Dox 


A 


310 


Ac-R-G-S-G-R-A-L-Dox 


D 


311 


Ac-R-G-S-G-R-S-L-Dox 


B 


312 


Ac-R-G-S-G-R-S-nL-Dox 


A 


313 


Ac-R-G-S-G-R-A-nL-Dox 


B 


314 


Ac-R-G-S-G-R-S-S-L-Dox 


C 


315 


Ac-R-G-S-G-R-S-Cha-Dox 


C 


316 


AoR-G-S-G-R-S-S-Cha-Dox 


C 


317 


Ac-R-G-S-A-R-S-Cha-Dox 


B 


318 


Ac-R-G-S-A-R-S-S-Cha-Dox 


B 


319 


Ac-R-G-S-A-R-S-nV-Dox 


B 


320 


Ac-R-G-S-A-R-S-S-nV -Dox 


C 


321 


Ac-R-G-S-A-R-S-L-Dox 


D 


322 


Ac-R-(Me)C-P-G-R-V-V-Dox 


D 


323 


Ac-R-(Me)C-P-G-R-V-V-Dox 


D 


324 


Ac-R-C(Me)-P-G-R-S-L-Dox 


D 


325 



FIG. 2E 
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CONJUGATE 


uPA CT50 


SEQ ID NO 


Ac-R-L-P-G-R-S-L-Dox 


D 


326 


Ac-R-V-P-G-R-S-L-Dox 


D 


327 


Ac-R-V-P-G-R-S-dL-Dox 


D 


328 


Ac-R-nL-P-G-R-S-L-Dox 


D 


329 


Ac-R-G(tBu)-P-A-R-S-L-Dox 


D 


330 


Ac-R-L-P-A-R-S-L-Dox 


D 


331 


Ac-R-V-P-A-R-S-L-Dox 


D 


332 


Ac-R-nL-P-A-R-S-L-Dox 


D 


333 


Ac-l-V-S-GrR-A-L-Dox 


D 


334 


Ac-I-V-S-G-R-S-S-L-Dox 


D 


335 


Ac-I-V-S-G-R-A-S-L-Dox 


D 


336 


Ac-I-V-S-A-R-M-A-Dox 


B 


337 


Aol-V-S-A-R-nL-A-Dox 


B 


338 


Ac-I-V-S-A-R-S-L-Dox 


C 


339 


Ac-l-V-S-A-R-S-nL-Dox 


B 


340 


Ac-l-V-S-A-R-S-S-L-Dox 


C 


341 


Ac-G-S-G-R-S-A-Dox 


B 


685 


Ac-G-S-G-R-S-L-Dox 


c 


277 


Ac-G-S-G-R-A-L-Dox 


D 


278 


Ac-G-S-G-R-S-S-L-Dox 


D 


279 


Ac-G-S-G-R-L-Dox 


D 


280 


Ac-G-S-G-(4-guan)Phg-S-L-NH2 


D 


281 


Ao-G-S-G-R-S-S-Cha-Dox 


D 


282 


Ac-G-S-G-R-A-S-L-Dox 


D 


283 



FIG. 2F 
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CONJUGATE 


uPA CT50 


SEQ ID NO 


Ac-G-S-G-R-S-nL-Dox 


A 


284 


Ac-G-T-G-R-S-nL-Dox 


A 


285 


Succ-pA-T-G-R-S-n L-Dox 


A 


286 


Ac-G-T-G-R-S-hCha-Dox 


A 


287 


Ac-G-hS-G-R-S-nL-Dox 


D 


288 


Ac-G-dS-A-R-S-A-Dox 


C 


289 


Ac-G-S-A-R-S-L-Dox 


D 


290 


Ac-G-S-A-R-S-S-Cha-Dox 


C 


291 


Ac-G-S-A-R-S-S-L-Dox 


D 


292 


Ac-G-S-A-R-A-S-L-Dox 


D 


293 


Ac-V-S-G-R-S-L-Dox 


D 


294 


Ac-V-S-G-R-A-L-Dox 


D 


295 


Ac-V-S-G-R-A-S-L-Dox 


D 


296 


AoV-S-G-R-S-S-L-Dox 


D 


297 


Ac-V-S-A-R-M-A-Dox 


B 


298 


Ac-V-S-A-R-nL-A-Dox 


B 


299 


Ac-V-S-A-R-S-nL-Dox 


B 


300 


Ao-V-S-A-R-S-L-Dox 


D 


301 


Ac-(Me)C-P-G-R-V-dV-Dox 


D 


302 


Ac-(Me)C-P-G-R-V-V-Dox 


D 


303 


Ac-C(Me)-P-G-R-A-L-Dox 


D 


304 


Ac-C(Me)-P-G-R-S-L-Dox 


D 


305 


Ac-C(Me)-P-A-R-S-L-Dox 


D 


306 


Ac-C(Me)-P-A-R-A-S-L-Dox 


D 


307 



FIG. 2G 
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Ac-G(tBu)-P-G-R-S-L-Dox D 308 



FIG. 2H 
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CONJUGATE 


MTSPI CT50 


SEQ ID NO 


Ac-Q-S-R-A-A-Tax 


B 


552 


Ac-Q-S-R-S-A-Tax 


B 


553 


Ac-Q-S-R-S-G-Tax 


c 


554 


Ac-R-S-R-A-A-Tax 


B 


555 


Ac-R-Q-S-R-A-A-Tax 


A 


556 


Ac-R-Q-S-R-S-A-Tax 


A 


557 


Ac-R-Q-S-R-S-A-A-Tax 


A 


558 


Ac-R-Q-S-R-A-A-Tax 


A 


559 



FIG. 3 
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CONJUGATE 


uPA CT50 


SEQ ID NO 


Ac-R-G-S-G-R-S-A-Tax 


D 


559 


AoS-G-R-A-A-Tax 


D 


560 


Ac-S-G-R-S-A-Tax 


D 


561 


Ac-S-G-R-S-S-A-Tax 


D 


562 


Ac-S-G-R-A-S-A-Tax 


D 


563 


Ac-S-G-R-S-G-Tax 


D 


564 


Ac-S-G-R-S-S-G-Tax 


D 


565 


Ac-S-G-R-S-G-A-Tax 


D 


566 


Ac-S-G-R-S-G-G-Tax 


D 


567 


Ac-G-T-G-R-S-G-G-Tax 


C 


568 


Ac-G-S-G-R-S-G-G-Tax 


C 


518 



FIG. 4 
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CONJUGATE 


ET1 CT50 


SEQ ID NO 


Ac-L-R-R-Q-S-R-A-A-Dox 


B 


597 


MeS02-dA(Chx)-Abu-R-S-L-Dox 


D 


598 


Ac-R-A-R-S-L-Dox 


B 


599 


Ac-dA(Chx)-Abu-R-S-L-Dox 


C 


600 


Ac-dA(Chx)-Abu-R-S-S-L-Dox 


B 


601 


Ac-Q-G-R-S-S-L-Dox 


A 


602 


MeOCO-dhF-PfOHVR-S-S-L-Dox 

tvi^^^^^^^^ villi 1 l^^l ■/ 1 ^ i^VXV 


B 


603 


MeOCO-Quat4-G-R-S-L-Dox 


D 


604 


Ac-dCha-P(OH)-R-S-S-L-Dox 


B 


605 


Ac-dCha-Abu-R-S-S-A-Tax 


B 


606 


MeOCO-Quat2-G-R-S-L-NH2 


B 


607 


MeOCO-Quat3-G-R-S-L-NH2 


B 


608 


MeOCO-Quat-G-R-S-L-NH2 


C 


609 
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SEQUENCE LISTING 

<110> Edwin Ii. Madison 

Joseph Edward Semple 
George P. Vlasuk 

Scott Jeffrey Kemp 
Mallareddy Komandla 
Dani e 1 Vaima S i e v 

<120> Conjugates Activated By Cell Surface Proteases and Therapeutic Uses Thereof 

<130> 24745-1611PC 

<140> Not Yet Assigned 
<141> Herewith 

<160> 611 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 3147 

<212> DNA 

<213> Homo Sapien 

<220> 

<223> Nucleotide sequence encoding MTSPl 
<221> CDS 

<222> (23) . . . (2589) 
<300> 

c301> O'Brien, T.J. and Tanimoto, H. 
<308> GenBank AR081724 
<310> US Pat 5972616 
<311> 1998-02-20 
<312> 1999-10-26 

<400> 1 

tcaagagcgg cctcggggta cc atg ggg age gat egg gcc cgc aag ggc gga 52 

Met Gly Ser Asp Arg Ala Arg Lys Gly Gly 
15 10 

gg9 ccg aag gac ttc ggc gcg gga etc aag tac aac tec egg cac 

Gly Gly Pro Lys Asp Phe Gly Ala Gly Leu Lys Tyr Asn Ser Arg His 

15 20 25 

gag aaa gtg aat ggc ttg gag gaa ggc gtg gag ttc ctg cea gtc aac 
Glu Lys Val Asn Gly Leu Glu Glu Gly Val Glu Phe Leu Pro Val Asn 
30 35 40 

aac gtc aag aag gtg gaa aag cat ggc ccg ggg cgc tgg gtg gtg ctg 
Asn Val Lys Lys Val Glu Lys His Gly Pro Gly Arg Txp Val Val Leu 
45 50 5S 

gca gcc gtg ctg ate ggc etc etc ttg gtc ttg ctg ggg ate ggc ttc 
Ala Ala Val Leu lie Gly Leu Leu Leu Val Leu Leu Gly lie Gly Phe 
60 65 70 

ctg gtg tgg cat ttg cag tac egg gac gtg cgt gtc cag aag gtc ttc 
Leu Val Trp His Leu Gin Tyr Arg Asp Val Arg Val Gin Lys Val Phe 
75 80 85 90 



100 
148 
196 
244 
292 
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aat ggc tac atg agg ate aca aat gag aat ttt gtg gat gcc tac gag 
Asn Gly Tyr Met Arg lie Thr Asn Glu Asn Phe Val Asp Ala Tyr Glu 

95 100 105 



aag gag teg get gtg acg gcc ttc age gag ggc age gtc ate gcc tac 
Lys Glu Ser Ala Val Thr Ala Phe Ser Glu Gly Ser Val He Ala Tyr 
140 145 150 



340 



aae tec aac tec act gag ttt gta age ctg gcc age aag gtg aag gac 388 
Asn Ser Asn Ser Thr Glu Phe Val Ser Leu Ala Ser Lys Val Lys Asp 
110 115 120 

geg ctg aag etg ctg tac age gga gtc eca ttc ctg ggc - ccc tac cac 436 
Ala Leu Lys Leu Leu Tyr Ser Gly Val Pro Phe Leu Gly Pro Tyr His 
125 130 135 



484 



tac tgg tct gag ttc age ate ceg cag eae ctg gtg gag gag gcc gag 532 
Tyr Trp Ser Glu Phe Ser He Pro Gin His Leu Val Glu Glu Ala Glu 
155 160 165 170 

cge gtc atg gcc gag gag cgc gta gtc atg ctg ccc ceg egg geg cgc 580 
Arg Val Met Ala Glu Glu Arg Val Val Met Leu Pro Pro Arg Ala Arg 

175 180 185 

tec etg aag tec ttt gtg gtc ace tea gtg gtg get ttc ccc acg gac 
Ser Leu Lys Ser Phe Val Val Thr Ser Val Val Ala Phe Pro Thr Asp 
190 195 200 

tec aaa aca gta cag agg acc cag gac aac age tgc age ttt ggc ctg 

Ser Lys Thr Val Gin Arg Thr Gin Asp Asn Ser Cys Ser Phe Gly Leu 
205 210 215 

r 

cac gee cgc ggt gtg gag ctg atg cgc ttc acc acg ccc ggc ttc cct 

His Ala Arg Gly Val Glu Leu Met Arg Phe Thr Thr Pro Gly Phe Pro 

220 225 230 

gac age ccc tac ccc get cat gcc cgc tgc cag tgg gcc ctg egg ggg 772 
Asp Ser Pro Tyr Pro Ala His Ala Arg Cys Gin Trp Ala Leu Arg Gly 
235 240 245 250 



628 



676 



724 



868 



916 



gac gcc gac tea gtg ctg age etc acc ttc cgc age ttt gae ett geg 820 

Asp Ala Asp Ser Val Leu Ser Leu Thr Phe Arg Ser Phe Asp Leu Ala 

255 260 265 

tec tgc gac gag cgc ggc age gac ctg gtg acg gtg tac aac acc ctg 

Ser Cys Asp Glu Arg Gly Ser Asp Leu Val Thr Val Tyr Asn Thr Leu 

270 275 280 

age ccc atg gag ccc cac gcc ctg gtg cag ttg tgt ggc acc tac cct 

Ser Pro Met Glu Pro His Ala Leu Val Gin Leu Cys Gly Thr Tyr Pro 
285 290 295 

eee tee tac aae ctg acc ttc cac tec tec cag aac gtc etg etc ate 964 

Pro Ser Tyr Asn Leu Thr Phe His Ser Ser Gin Asn Val Leu Leu He 
300 305 310 

aca etg ata acc aac act gag egg egg eat ccc ggc ttt gag gcc ace 1012 

Thr Leu He Thr Asn Thr Glu Arg Arg His Pro Gly Phe Glu Ala Thr 

315 320 325 330 



ttc ttc cag ctg cct agg atg age age tgt gga ggc cgc tta cgt aaa 



1060 
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Phe Phe Gin Leu Pro Arg Met Ser Ser Cys Gly Gly Arg Leu Arg Lys 

335 340 345 

gcc cag ggg aca ttc aac age ccc tac tac cca ggc cac tac cca ccc 
Ala Gin Gly Thr Phe Asn Ser Pro Tyr Tyr Pro Gly His Tyr Pro Pro 
350 - 355 360 



gcc gac tgc acc gac cac age gat gag etc aac tgc agt tgc gac gcc 

Ala Asp Cys Thr Asp Hie Ser Asp Glu Leu Asn Cys Ser Cys Asp Ala 

475 480 485 490 

ggc cac cag ttc acg tgc aag aac aag ttc tgc aag ccc etc ttc tgg 

Gly His Gin Phe Thr Cys Lys Asn Lys Phe Cys Lys Pro Leu Phe Trp 

495 500 505 



teg aaa age cag cag tgc aat ggg aag gac gac tgt ggg gac ggg tec 
Ser Lys Ser Gin Gin Cys Asn Gly Lys Asp Asp Cys Gly Asp Gly Ser 
540 545 550 

gac gag gcc tec tgc ccc aag gtg aac gtc gtc act tgt acc aaa cac 
Asn Glu Ala Ser Cys Pro Lys Val Asn Val Val Thr Cys Thr Lys His 
555 560 565 570 

acc tac cgc tgc etc aat ggg etc tgc ttg age aag ggc aac cet gag 
Thr Tyr i\rg Cys Leu Asn Gly Leu Cys Leu Ser Lys Gly Asn Pro Glu 



1108 



aac att gac tgc aca tgg aac att gag gtg ccc aac aac cag cat gtg 1156 
Asn lie Asp Cys Thr Trp Asn lie Glu Val Pro Asn Asn Gin His Val 
365 370 375 

aag gtg age ttc aaa ttc ttc tac ctg ctg gag ccc ggc gtg cet gcg 
Lys Val Ser Phe Lys Phe Phe Tyr Leu Leu Glu Pro Gly Val Pro Ala 
380 385 390 



1204 



ggc acc tgc ccc aag gac tac gtg gag ate aat ggg gag aaa tac tgc 1252 
Gly Thr Cys Pro Lys Asp Tyr Val Glu He Asn Gly Glu Lys Tyr Cys 
395 400 405 410 

gga gag agg tec cag ttc gtc gtc ace age aac age aac aag ate aca 
Gly Glu Arg Ser Gin Phe Val Val Thr Ser Asn Ser Asn Lys He Thr 

415 420 425 

gtt cgc ttc cac tea gat cag tec tac acc gac acc ggc ttc tta get 
Val Arg Phe His Ser Asp Gin Ser Tyr Thr Asp Thr Gly Phe Leu Ala 
430 435 440 

gaa tac etc tec tac gac tec agt gac cca tgc ecg ggg cag ttc acg 
Glu Tyr Leu Ser Tyr Asp Ser Ser Asp Pro Cys Pro Gly Gin Phe Thr 
445 450 455 

tgc cgc acg ggg egg tgt ate egg aag gag ctg cgc tgt gat ggc tgg 1444 
Cys Arg Thr Gly Arg Cys He Arg Lys Glu Leu Arg Cys Asp Gly Trp 
460 465 470 



1300 



1348 



1396 



1492 



1540 



gtc tgc gac agt gtg aac gac tgc gga gac aac age gac gag cag ggg 1588 
Val Cys Asp Ser Val Asn Asp Cys Gly Asp Asn Ser Asp Glu Gin Gly 
510 515 520 



I 



tgc agt tgt ccg gcc cag acc ttc agg tgt tec aat ggg aag tgc etc 1636 
Cvs Ser Cys Pro Ala Gin Thr Phe Arg Cys Ser Asn Gly Lys Cys Leu 
525 530 535 



1684 



1732 



1780 
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575 580 585 

tgt gac ggg aag gag gac tgt age gac ggc tea gat gag aag gac tgc 1828 
Cys Asp Gly Lys Glu Asp Cys Ser Asp Gly Ser Asp Glu Lys Asp Cys 
590 595 600 

gac tgt ggg ctg egg tea ttc acg aga cag get cgt gtt gtt ggg ggc 1876 
Asp Cys Gly Leu Arg Ser Phe Thr Arg Gin Ala Arg Val Val Gly Gly 
605 610 615 

acg gat gcg gat gag gge gag tgg ccc tgg cag gta age ctg cat get 1924 
Thr Asp Ala Asp Glu Gly Glu Trp Pro Trp Gin Val Ser Leu His Ala 
620 625 630 

Ctg ggc cag ggc cac ate tgc ggt get tee etc ate tct ccc aae tgg 1972 
Leu Gly Gin Gly His lie Cys Gly Ala Ser Leu lie Ser Pro Asn Trp 
635 640 645 650 

ctg gtc tct gcc gca cae tgc tae ate gat gac aga gga ttc agg tac 202 0 

Leu Val Ser Ala Ala His Cys Tyr He Asp Asp Arg Gly Phe Arg Tyr 

655 660 665 

tea gae ccc acg cag tgg acg gcc ttc ctg ggc ttg cac gac eag age 2068 
Ser Asp Pro Thr Gin Trp Thr Ala Phe Leu Gly Leu His Asp Gin Ser 
670 675 680 

cag cge age gee cet ggg gtg cag gag cgc agg etc aag cgc ate ate 2116 
Gin Arg Ser Ala Pro Gly Val Gin Glu Arg Arg Leu Lys Arg He He 
685 690 695 

tec cae eee ttc ttc aat gac ttc acc ttc gac tat gac ate gcg ctg 2164 
Ser His Pro Phe Phe Asn Asp Phe Thr Phe Asp Tyr Asp He Ala Leu 
700 705 710 

ctg gag ctg gag aaa ccg gca gag tac age tec atg gtg egg ccc ate 2212 
Leu Glu Leu Glu Lys Pro Ala Glu Tyr Ser Ser Met Val Arg Pro He 
715 720 725 730 

tgc ctg ccg gac gcc tec cat gtc ttc ect gee ggc aag gcc ate tgg 2260 
Cys Leu Pro Asp Ala Ser His Val Phe Pro Ala Gly Lys Ala He Trp 

735 740 745 

gtc acg ggc tgg gga cae acc cag tat gga gge act ggc gcg ctg ate 2308 
Val Thr Gly Trp Gly His Thr Gin Tyr Gly Gly Thr Gly Ala Leu He 
750 755 760 

ctg caa aag ggt gag ate cge gtc ate aac cag acc acc tgc gag aac 2356 
Leu Gin Lys Gly Glu He Arg Val He Asn Gin Thr Thr Cys Glu Asn 
765 770 775 

etc ctg ccg cag cag ate acg ccg cgc atg atg tgc gtg ggc ttc etc 2404 
Leu Leu Pro Gin Gin He Thr Pro Arg Met Met Cys Val Gly Phe Leu 
780 785 790 

age ggc ggc gtg gac tec tgc cag ggt gat tec ggg gga ccc ctg tec 2452 
Ser Gly Gly Val Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Ser 
795 800 805 810 

age gtg gag gcg gat ggg egg ate ttc cag gcc ggt gtg gtg age tgg 2500 
Ser Val Glu Ala Asp Gly Arg He Phe Gin Ala Gly Val Val Ser Trp 

815 820 825 
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gga gac ggc tgc get cag agg aac aag cca ggc gtg tac aca agg etc 2548 
Gly Asp Gly Cys Ala Gin Arg Asn Lys Pro Gly Val Tyr Thr Arg lieu 
830 835 840 

cct ctg ttt egg gac tgg ate aaa gag aac aet ggg gta ta ggggcegggg 2599 
Pro Leu Phe Arg Asp Trp lie Lys Glu Aen Thr Gly Val 
845 B50 855 

ceaeccaaat gtgtacacct gcggggccac ccatcgtcea ccccagtgtg cacgcctgca 2 659 

ggctggagac tggacegctg actgcaccag egececcaga acataeaetg tgaaeteaat 2719 

ctccagggct ceaaatctgc ctagaaaacc tctcgcttee teagectcca aagtggagct 2779 

gggaggtaga aggggaggae actggtggtt ctactgaecc aaetggggge aaaggtttga 2839 

agacacagce tcccccgcca gccccaagct gggccgaggc gcgtttgtgt atatctgcet 2899 

cccctgtetg taaggagcag cgggaaegga getteggage eteetcagtg aaggtggtgg 2959 

ggctgecgga tctgggctgt ggggcccttg ggccacgctc ttgaggaage ceaggetcgg 3019 

aggaecctgg aaaacagacg ggtctgagac tgaaattgtt t tac cage tc ccagggtgga 3079 

cttcagtgtg tgtatttgtg taaatgggta aaacaattta tttcttttta aaaaaaaaaa 3139 

aaaaaaaa 3147 

<210> 2 

<211> 855 

<212> PRT 

<213> Homo Sapien 



<400> 2 



Met 


Gly 


Ser 


Asp 


Arg 


Ala 


Arg 


Lys 


Gly 


Gly 


Gly 


Gly 


Pro 


Lys 


Asp 


Phe 


1 








5 










10 










15 




Gly Ala 


Gly 


Leu 


Lye 


Tyr 


Asn 


Ser 


Arg 


His 


Glu 


Lys 


Val 


Asn 


Gly 


Leu 








20 










25 










30 






Glu 


Glu 


Gly 
35 


Val 


Glu 


Phe 


Leu 


Pro 
40 


Val 


Asn 


Asn 


Val 


Lys 
45 


Lys 


Val 


Glu 


Lys 


His 
50 


Gly 


Pro 


Gly 


Arg 


Trp 
55 


Val 


Val 


Leu 


Ala 


Ala 

60 


Val 


Leu 


He 


Gly 


Leu 


Leu 


Leu 


Val 


Leu 


Leu 


Gly 


lie 


Gly 


Phe 


Leu 


Val 


Trp 


His 


Leu 


Gin 


65 










70 










75 










80 


Tyr Arg 


Asp 


Val 


Arg 


Val 


Gin 


Lys 


Val 


Phe 


Asn 


Gly 


Tyr 


Met 


Arg 


He 










85 










90 










95 




Thr 


Asn 


Glu 


Asn 
100 


Phe 


Val 


Asp 


Ala 


Tyr 
105 


Glu 


Asn 


Ser 


Asn 


Ser 
110 


Thr 


Glu 


Phe 


Val 


Ser 
115 


Leu 


Ala 


Ser 


Lys 


Val 
120 


Lys 


Asp 


Ala 


Leu 


Lys 
125 


Leu 


Leu 


Tyr 


Ser Gly 


Val 


Pro 


Phe 


Leu 


Gly 


Pro 


Tyr 


His 


Lys 


Glu 


Ser 


Ala 


Val 


Thr 




130 










135 










140 










Ala 


Phe 


Ser 


Glu 


Gly 


Ser 


Val 


He 


Ala 


Tyr 


Tyr 


Trp 


Ser 


Glu 


Phe 


Ser 


145 










150 










155 










160 


lie 


Pro 


Gin 


His 


Leu 
165 


Val 


Glu 


Glu 


Ala 


Glu 
170 


Arg 


Val 


Met 


Ala 


Glu 
175 


Glu 


Arg 


Val 


Val 


Met 


X^u 


Pro 


Pro 


Arg 


Ala 


Arg 


Ser 


Leu 


Lye 


Ser 


Phe 


Val 






180 










185 










190 






Val 


Thr 


Ser 
195 


Val 


Val 


Ala 


Phe 


Pro 
200 


Thr 


Asp 


Ser 


Lys 


Thr 
205 


Val 


Gin 


Arg 


Thr 


Gin 

210 


Asp 


Asn 


Ser 


Cys 


Ser 
215 


Phe 


Gly 


Leu 


His 


Ala 
220 


Arg 


Gly 


Val 


Glu 


Leu 


Met 


Arg 


Phe 


Thr 


Thr 


Pro 


Gly 


Phe 


Pro 


Asp 


Ser 


Pro 


Tyr 


Pro 


Ala 


225 










230 










235 










240 


His 


Ala 


Arg 


Cys 


Gin 
245 


Trp 


Ala 


Leu 


Arg 


Gly 
250 


Asp 


Ala 


Asp 


Ser 


Val 
255 


Leu 


Ser 


Leu 


Thr 


Phe 
260 


Arg 


Ser 


Phe 


Asp 


Leu 
265 


Ala 


Ser 


Cye 


Asp 


Glu 
270 


Arg 


Gly 


Ser 


Asp 


Leu 


Val 


Thr 


Val 


Tyr 


Asn 


Thr 


Leu 


Ser 


Pro 


Met 


Glu 


Pro 


His 
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275 

Ala Leu Val Gin 
290 

Phe His Ser Ser 
305 

Glu Arg Arg His 

Met Ser Ser Cys 
340 

Ser Pro Tyr Tyx: 
355 

Asn lie Glu Val 
370 

Phe Tyr Leu Leu 
385 

Tyr Val Glu lie 

Val Val Thr Ser 
420 

Gin Ser Tyr Thr 
435 

Ser Ser Asp Pro 
450 

lie Arg Lys Glu 
465 

Ser Asp Glu Leu 

Lys Asn Lys Phe 
500 

Asp Cys Gly Asp 
515 

Thr Phe Arg Cys 
530 

Asn Gly Lys Asp 
545 

Lys Val Asn Val 

Gly Leu Cys Leu 
580 

Cys Ser Asp Gly 
595 

Phe Thr Arg Gin 
610 

Glu Trp Pro Trp 
625 

Cys Gly Ala Ser 

Cys Tyr lie Asp 
660 

Thr Ala Phe Leu 
675 

Val Gin Glu Arg 
690 

Asp Phe Thr Phe 
705 

Ala Glu Tyr Ser 

His Val Phe Pro 
740 

Thr Gin Tyr Gly 
755 



280 

Leu Cys Gly Thr 
2 95 

Gin Asn Val lieu 
310 

Pro Gly Phe Glu 
325 

Gly Gly Arg Leu 

Pro Gly His Tyr 
360 

Pro Asn Asn Gin 
375 

Glu Pro Gly Val 
390 

Asn Gly Glu Lys 
405 

Asn Ser Asn Lys 

Asp Thr Gly Phe 
440 

Cys Pro Gly Gin 
455 

Leu Arg Cys Asp 
470 

Asn Cys Ser Cys 
485 

Cys Lys Pro Leu 

Asn Ser Asp Glu 
520 

Ser Asn Gly Lys 
535 

Asp Cys Gly Asp 
550 

Val Thr Cys Thr 
565 

Ser Lys Gly Asn 

Ser Asp Glu Lys 
600 

Ala Arg Val Val 
615 

Gin Val Ser Leu 
630 

Leu He Ser Pro 
645 

Asp Arg Gly Phe 

Gly Leu His Asp 
680 

Arg Leu Lys Arg 
695 

Asp Tyr Asp He 
710 

Ser Met Val Arg 
725 

Ala Gly Lys Ala 

Gly Thr Gly Ala 
760 



Tyr Pro Pro Ser 
300 

Leu He Thr Leu 
315 

Ala Thr Phe Phe 
330 

Arg Lys Ala Gin 
345 

Pro Pro Asn He 

His Val Lys Val 
380 

Pro Ala Gly Thr 
395 

Tyr Cys Gly Glu 
410 

He Thr Val Arg 
425 

Leu Ala Glu Tyr 

Phe Thr Cye Arg 
460 

Gly Trp Ala Asp 
475 

Asp Ala Gly His 
490 

Phe Trp Val Cys 
505 

Gin Gly Cys Ser 

Cys Leu Ser Lys 
540 

Gly Ser Asp Glu 
555 

Lys His Thr Tyr 
570 

Pro Glu Cys Asp 
585 

Asp Cys Asp Cys 

Gly Gly Thr Asp 
620 

His Ala Leu Gly 
635 

Asn Trp Leu Val 
650 

Arg Tyr Ser Asp 
665 

Gin Ser Gin Arg 

He He Ser His 
700 

Ala Leu Leu Glu 
715 

Pro He Cys Leu 
730 

He Trp Val Thr 
745 

Leu He Leu Gin 



285 

Tyr Asn Leu Thr 

He Thr Asn Thr 
320 

Gin Leu Pro Arg 
335 

Gly Thr Phe Asn 
350 

Asp Cys Thr Trp 
365 

Ser Phe Lys Phe 

Cys Pro Lys Asp 
400 

Arg Ser Gin Phe 
415 

Phe His Ser Asp 
430 

Leu Ser Tyr Asp 
445 

Thr Gly Arg Cys 

Cys Thr Asp His 
480 

Gin Phe Thr Cys 
495 

Asp Ser Val Asn 
510 

Cys Pro Ala Gin 
525 

Ser Gin Gin Cys 

Ala Ser Cys Pro 
560 

Arg Cys Leu Asn 
575 

Gly Lys Glu Asp 
590 

Gly Leu Arg Ser 
605 

Ala Asp Glu Gly 

Gin Gly His He 
640 

Ser Ala Ala His 
655 

Pro Thr Gin Trp 
670 

Ser Ala Pro Gly 
685 

Pro Phe Phe Asn 

Leu Glu Lys Pro 
720 

Pro Asp Ala Ser 
735 

Gly Trp Gly His 
750 

Lys Gly Glu He 
765 
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Arg 


Val 


He Asn Gin Thr Thr 


Cys 


Glu 


Asn 


Leu Leu 


Pro Gin Gin 


He 


770 


775 






780 






Thr 


Pro 


Arg Met Met Cys Val 


Gly 


Phe 


Leu 


Ser Gly Gly Val Asp 


Ser 


785 




790 








795 




800 


Cys 


Gin 


Gly Asp Ser Gly Gly 


Pro 


Leu 


Ser 


Ser Val 


Glu Ala Asp 


Gly 






805 






810 




815 


i 


Arg 


He 


Phe Gin Ala Gly Val Val 


Ser 


Trp 


Gly Asp 


Gly Cys Ala 


Gin 




820 




825 






830 




Arg Aszi 


Lys Pro Gly Val Tyr 


Thr 


Arg 


Leu 


Pro Leu 


Phe- Arg Asp 


Trp 






835 


840 








845 




He 


Lys 


Glu Asn Thr Gly Val 
















850 


855 















<210> 3 

<211> 2137 

<212> DNA 

<213> Homo Sapien 

<220> 
<221> CDS 

<222> {261) . . . (1574) 

<223> Nucleic acid encoding a transmembreuie serine 
protease (MTSP3) protein 

<400> 3 

ccatcctaat acgactcact atagggctcg agcggccgcc cgggcaggtc agagagaggc 60 

agcagcttgc tcagcggaca aggatgctgg gcgtgaggga ccaaggcctg ccctgcactc 12 0 

gggcctcctc cagccagtgc tgaccaggga cttctgacct gctggccagc caggacctgt 180 

gtggggaggc cctcctgctg ccttggggtg acaatctcag ctccaggcta cagggagacc 240 

gggaggatca cagagccagc atg tta cag gat cct gac agt gat caa cct ctg 293 

Met Leu Gin Asp Pro Asp Ser Asp Gin Pro Leu 
15 10 



acc ttc aga aag gtg ggg ate ccc ate ate ata gca eta ctg age ctg 
Thr Phe Arg Lys Val Gly He Pro He He He Ala Leu Leu Ser Leu 
30 35 40 

geg agt ate ate att gtg gtt gte etc ate aag gtg att ctg gat aaa 
Ala Ser He He He Val Val Val Leu He Lys Val He Leu Asp Lys 
45 50 55 



aag gac cga tec aca ctg cag gtg ctg gac teg gee aca ggg aac tgg 
Lys Asp Arg Ser Thr Leu Gin Val Leu Asp Ser Ala Thr Gly Asn Trp 
110 115 120 



aac age etc gat gte aaa ccc ctg cgc aaa ccc cgt ate ccc atg gag 341 
Asn Ser Leu Asp Val Lys Pro Leu Arg Lys Pro Arg He Pro Met Glu 
15 20 25 



389 



437 



tac tac tte etc tgc ggg cag ect etc cac ttc ate ccg agg aag cag 485 

Tyr Tyr Phe Leu Cys Gly Gin Pro Leu His Phe He Pro Arg Lys Gin 
60 65 70 75 

ctg tgt gac gga gag ctg gac tgt ccc ttg ggg gag gac gag gag cac 533 

Leu Cys Asp Gly Glu Leu Asp Cys Pro Leu Gly Glu Asp Glu Glu His 

80 85 90 

tgt gte aag age ttc ccc gaa ggg cct gca gtg gca gte cgc etc tec 581 

Cys Val Lys Ser Phe Pro Glu Gly Pro Ala Val Ala Val Arg Leu Ser 

95 100 105 



629 
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ttc tct gcc tgt ttc gac aac ttc aca gaa get etc get gag aca gcc 677 
Phe Ser Ala Cys Phe Asp Asn Phe Thr Glu Ala Leu Ala Glu Thr Ala 
125 130 135 

tgt agg cag atg ggc tac age age aaa cce aec tte aga get gtg gag 725 
Cys Arg Gin Met Gly Tyr Ser Ser Lys Pro Thr Phe Arg Ala Val Glu 
140 145 ISO 155 



att ggc cca gac cag gat ctg gat gtt gtt gaa ate aca gaa aac age 
lie Gly Pro Asp Gin Asp Leu Asp Val Val Glu lie Thr Glu Asn Ser 

160 165 170 

cag gag ctt cgc atg egg aac tea agt ggg cec tgt etc tea ggc tec 
Gin Glu Leu Arg Met Arg Asn Ser Ser Gly Pro Cys Leu Ser Gly Ser 
175 180 185 

ctg gtc tec ctg cae tgt ctt gcc tgt ggg aag age ctg aag acc ccc 
Leu Val Ser Leu His Cys Leu Ala Cys Gly Lys Ser Leu Lys Thr Pro 
190 195 200 



gtc age ate cag tac gac ata cag eae gtc tgt gga ggg age ate ctg 
Val Ser lie Gin Tyr Asp He Gin His Val Cys Gly Gly Ser He Leu 
220 225 230 235 



773 



821 



869 



cgt gtg gtg ggt ggg gag gag gcc tct gtg gat tct tgg cct tgg cag 917 
Arg Val Val Gly Gly Glu Glu Ala Ser Val Asp Ser Trp Pro Trp Gin 
205 210 215 



965 



gac ccc cac tgg gtc etc aeg gca gcc cae tgc ttc agg aaa cat ace 1013 
Asp Pro His Trp Val Leu Thr Ala Ala His Cys Phe Arg Lys His Thr 

240 245 250 

gat gtg ttc aac tgg aag gtg egg gca ggc tea gac aaa ctg ggc age 1061 
Asp Val Phe Asn Trp Lys Val Arg Ala Gly Ser Asp Lys Leu Gly Ser 
255 260 265 

ttc cca tec ctg get gtg gee aag ate ate ate att gaa ttc aac ccc 1109 
Phe Pro Ser Leu Ala Val Ala Lys lie He He He Glu Phe Asn Pro 
270 275 280 

atg tac ccc aaa gac aat gae ate gcc etc atg aag ctg cag tte cca 
Met Tyr Pro Lys Asp Asn Asp He Ala Leu Met Lys Leu Gin Phe Pro 
285 290 295 

etc act ttc tea ggc aca gtc agg etc ate tgt ctg ccc ttc ttt gat 
Leu Thr Phe Ser Gly Thr Val Arg Leu He Cys Leu Pro Phe Phe Asp 
300 305 310 315 

gag gag etc act cca gee acc cca etc tgg ate att gga tgg ggc ttt 1253 
Glu Glu Leu Thr Pro Ala Thr Pro Leu Trp He He Gly Trp Gly Phe 

320 325 330 

aeg aag cag aat gga ggg aag atg tct gac ata ctg ctg cag gcg tea 1301 
Thr Lys Gin Asn Gly Gly Lys Met Ser Asp He Leu Leu Gin Ala Ser 
335 340 345 

gtc cag gtc att gac age aca egg tgc aat gca gac gat gcg tac cag 1349 
Val Gin Val He Asp Ser Thr Arg Cys Asn Ala Asp Asp Ala Tyr Gin 
350 355 360 

ggg gaa gtc ace gag aag atg atg tgt gca ggc ate ccg gaa ggg ggt 1397 



1157 



1205 
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Gly Glu Val Thr Glu Lys Met Met Cys Ala Gly lie Pro Glu Gly Gly 
365 370 375 

gtg gac acc tgc cag ggt gac agt ggt ggg ccc ctg atg tac caa tct 1445 
Val Asp Thr Cys Gin Gly Asp Sex Gly Gly Pro Leu Met Tyr Gin Ser 
380 365 390 395 

gac cag tgg cat gtg gtg ggc ate gtt age tgg ggc tat ggc tgc ggg 1493 
Asp Gin Trp His Val Val Gly He Val Ser Trp Gly Tyr Gly Cys Gly 

400 405 410 

ggc ccg age acc cca gga gta tac acc aag gtc tea gee tat etc aac 1541 
Gly Pro Ser Thr Pro Gly Val Tyr Thr Lys Val Ser Ala Tyr Leu Asn 
415 420 425 

tgg ate tac aat gtc tgg aag get gag ctg taa tgctgctgcc cctttgcagt 1594 
Trp He Tyr Asn Val Trp Lys Ala Glu Leu * 
430 435 

gctgggagcc gcttccttcc tgccctgccc acctggggat cccccaaagt cagaeacaga 1654 

gcaagagtcc ccttgggtac acccctctgc ccacagcctc agcatttctt ggagcagcaa 1714 

agggcctcaa ttcctgtaag agaccctegc agcccagagg cgcccagagg aagtcagcag 1774 

ccctagctcg gccacacttg gtgctcccag catcccaggg agagacacag cccactgaac 1834 

aaggtctcag gggtattgct aagccaagaa ggaactttcc cacactactg aatggaagca 1894 

ggctgtcttg taaaagecca gatcactgtg ggctggagag gagaaggaaa gggtctgcgc 1954 

cagccctgtc cgtcttcacc catccccaag ectactagag caagaaacca gttgtaatat 2014 

aaaatgcact gecetactgt tggtatgact accgttacct actgttgtca ttgttattac 2074 

agctatggcc act at tat t a aagagctgtg taaeaaaaaa aaaaaaaaaa aaaaaaaaaa 2134 

aaa 2137 



<210> 4 

<211> 437 

<212> PRT 

<:213> Homo Sapien 



<400> 4 



Met 


Leu 


Gin 


Asp 


Pro 
5 


Asp 


Ser 


Asp 


1 
Lys 


Pro 


Leu 


Arg 


Lys 


Pro 


Arg 


He 






20 










Gly 


He 


Pro 


He 


He 


He 


Ala 


Leu 




35 










40 


Val 


Val 


Val 


Leu 


He 


Lys 


Val 


He 




50 










55 




Gly 


Gin 


Pro 


Leu 


His 


Phe 


He 


Pro 


65 










70 






Leu Asp 


Cys 


Pro 


Leu 


Gly 


Glu 


Asp 










85 








Pro 


Glu 


Gly 


Pro 


Ala 


Val 


Ala 


Val 








100 










Leu 


Gin 


Val 


Leu 


Asp 


Ser 


Ala 


Thr 






115 










120 


Asp 


Asn 


Phe 


Thr 


Glu 


Ala 


Leu 


Ala 


130 










135 




Tyr 


Ser 


Ser 


Lys 


Pro 


Thr 


Phe 


Arg 


145 










150 






Asp 


Leu 


Asp 


Val 


Val 


Glu 


He 


Thr 








165 








Arg 


Asn 


Ser 


Ser 


Gly 


Pro 


Cys 


Leu 






180 










Cys 


Leu 


Ala 


Cys 


Gly 


Lys 


Ser 


Leu 



Gin 


Pro 


Leu 


Asn 


Ser 


Leu Asp 


Val 




10 










15 




Pro 


Met 


Glu 


Thr 


Phe 


Arg 


Lys 


Val 


25 










30 






Leu 


Ser 


Leu 


Ala 


Ser 


He 


He 


He 










45 








Leu 


Asp 


Lys 


Tyr 

c n 


Tyr 


Phe 


Leu 


Cys 


Arg 


Lys 


Gin Leu Cys Asp Gly Glu 




75 










80 


Glu 


Glu 


His 


Cys 


Val 


Lys 


Ser 


Phe 




90 










95 




Arg 


Leu 


Ser 


Lys 


Asp 


Arg 


Ser 


Thr 


105 










110 






Gly 


Asn 


Trp 


Phe 


Ser 


Ala 


Cys 


Phe 








125 








Glu 


Thr 


Ala 


Cys 


Arg 


Gin 


Met 


Gly 








140 










Ala 


Val 


Glu 


He 


Gly 


Pro 


Asp 


Gin 






155 










160 


Glu 


Asn 


Ser 


Gin 


Glu 


Leu 


Arg 


Met 




170 










175 




Ser 


Gly 


Ser 


Leu 


Val 


Ser 


Leu 


His 


185 








190 






Lys 


Thr 


Pro Arg 


Val 


Val 


Gly Gly 
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195 










200 


Glu 


Glu 


Ala 


Ser 


Val 


Asp 


Ser 


Trp 




210 










215 




Asp 


He 


Gin 


His 


Val 


Cys 


Gly Gly 


225 










230 






lieu 


Thr 


Ala 


Ala 


His 


Cys 


Phe 


Arg 










245 








Lys 


Val 


Arg Ala Gly 


Ser 


Asp 


Lys 








260 










Val 


Ala 


Lys 


He 


He 


He 


He 


Glu 






275 










280 


Asn Asp 


He 


Ala 


Leu 


Met 


Lys 


Leu 




290 










295 




Thr 


Val 


Arg Leu 


He 


Cys 


Leu 


Pro 


305 










310 






Ala 


Thr 


Pro 


Leu 


Trp 


He 


He 


Gly 










325 








Gly Lys 


Met 


Ser 


Asp 


He 


Leu 


Leu 








340 










Ser 


Thr 


Arg 


Cys 


Asn 


Ala 


Asp 


Asp 






355 










360 


Lys 


Met 


Met 


Cys 


Ala 


Gly 


He 


Pro 




370 










375 




Gly Asp 


Ser Gly Gly 


Pro 


Leu 


Met 


385 










390 






Val 


Gly 


He 


Val 


Ser 


Trp 


Gly Tyr 










405 








Gly Val 


Tyr 


Thr 


Lys 


Val 


Ser 


Ala 








420 










Trp 


Lys 


Ala 


Glu 


Leu 









435 









205 








Pro 


Trp Gin 


Val 


Ser 


He 


Gin 


Tyr 






220 










Ser 


He Leu 


Asp 


Pro 


His 


Trp 


Val 




235 










240 


Lys 


His Thr 


Asp 


Val 


Phe 


Asn 


Trp 




250 








255 




Leu 


Gly Ser 


Phe 


Pro 


Ser 


Leu 


Ala 


265 






270 






Phe 


Asn Pro 


Met 


Tyr 


Pro 


Lys 


Asp 








285 








Gin 


Phe Pro 


Leu 


Thr 


Phe 


Ser 


Gly 






300 










Phe 


Phe Asp 


Glu 


Glu 


Leu 


Thr 


Pro 




315 










320 


Trp 


Gly Phe 


Thr 


Lys 


Gin 


Asn 


Gly 




330 








335 




Gin 


Ala Ser 


Val 


Gin 


Val 


He 


Asp 


345 








350 






Ala 


Tyr Gin Gly 


Glu 


Val 


Thr 


Glu 








365 








Glu 


Gly Gly Val 


Asp 


Thr 


Cys 


Gin 






380 










Tyr 


Gin Ser Asp 


Gin 


Trp 


His 


Val 


3 95 










400 


Gly 


Cys Gly Gly 


Pro 


Ser 


Thr 


Pro 




410 








415 




Tyr 


Leu Asn Trp 


He 


Tyr 


Asn 


Val 


425 








430 







<210> 5 

<211> 708 

<212> DMA 

<213> Homo Sapien 



<220> 

<221> CDS 

<222> (1) . . . (708) 

<223> Nucleic acid encoding an MTSP4 protease domaan 



<400> 5 

att gtt ggt gga get gtg tec tec gag ggt gag tgg cea tgg cag gcc 

He Val Gly Gly Ala Val Ser Ser Glu Gly Glu Trp Pro Trp Gin Ala 
IS 10 15 

age etc cag gtt egg ggt cga cac ate tgt ggg ggg gee etc ate get 

Ser Leu Gin Val Arg Gly Arg His He Cys Gly Gly Ala Leu He Ala 
20 25 30 



48 



96 



gae egc tgg gtg ata aea get gcc cac tge ttc cag gag gac age atg -144 

Asp Arg Trp Val He Thr Ala Ala His Cys Phe Gin Glu Asp Ser Met 
35 40 45 

gcc tec acg gtg etg tgg aec gtg tte ctg ggc aag gtg tgg eag aae 192 

Ala Ser Thr Val Leu Trp Thr Val Phe Leu Gly Lys Val Trp Gin Asn 
SO 55 60 



teg cge tgg cet gga gag gtg tec tte aag gtg age cge etg etc ctg 240 
Ser Arg Trp Pro Gly Glu Val Ser Phe Lys Val Ser Arg Leu Leu Leu 
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65 70 75 80 

cac ccg tac cac gaa gag gac age cat gac tac gac gtg gcg ctg ctg 2 88 

Hie Pro Tyx His Glu Glu Asp Ser His Asp Tyx Asp Val Ala Leu Leu 

85 90 95 

cag etc gac cac ccg gtg gtg cgc teg gcc gcc gtg cgc ccc gtc tgc 336 

Gin Leu Asp His Pro Val Val Arg Ser Ala Ala Val Arg Pro Val Cys 

1 nc Tin 



100 105 110 

ctg ccc gcg cgc tec cac ttc ttc gag ccc ggc ctg cac tge tgg att 
Leu Pro Ala Arg Ser His Phe Phe Glu Pro Gly Leu His Cys Trp He 
115 120 125 



cag aaa gtg gat gtg cag ttg ate cca cag gac ctg tgc age gag gtc 
Gin Lys Val Asp Val Gin Leu He Pro Gin Asp Leu Cys Ser Glu Val 
145 150 155 160 

tat cgc tac cag gtg acg cca cgc atg ctg tgt gcc ggc tac cgc aag 
Tyr Arg Tyr Gin Val Thr Pro Arg Met Leu Cys Ala Gly Tyr Arg Lys 

165 170 175 



ctg ggc tgt ggc egg ect aac tac ttc ggc gtc tac ace cgc ate aca 
Leu Gly Cys Gly Arg Pro Asn Tyr Phe Gly Val Tyr Thr Arg He Thr 
210 215 220 

ggt gtg ate age tgg ate cag caa gtg gtg ace tga 
Gly Val He Ser Trp He Gin Gin Val Val Thr * 
225 230 235 

<210> 6 

<211> 235 

<212> PRT 

<213> Homo Sapien 

<400> 6 
He Val 
1 



50 

Ser Ari 
65 



Gly 


Gly 


Ala Val Ser 


Ser Glu Gly Glu Trp 


Pro 


Trp 


Gin 


Ala 


5 






10 








15 




Gin 


Val 


Tirg Gly Arg His 


He 


Cys 


Gly Gly Ala 


Leu 


He 


Ala 




20 






25 








30 






Trp 


Val 


He Thr Ala 


Ala 


His 


Cys 


Phe Gin 


Glu 


Asp 


Ser 


Met 


35 






40 








45 








Thr 


Val 


Leu Trp Thr 
55 


Val 


Phe 


Leu 


Gly Lys 
60 


Val 


Trp 


Gin 


Asn 


Trp 


Pro 


Gly Glu Val 


Ser 


Phe 


Lys 


Val Ser 


Arg 


Leu 


Leu 


Leu 




70 








75 








80 


Tyr 


His 


Glu Glu Asp 


Ser 


His 


Asp 


Tyr Asp 


Val 


Ala 


Leu 


Leu 




85 






90 








95 





3 84 



acg ggc tgg ggc gcc ttg cgc gag ggc ggc ccc ate age aac get ctg 432 
Thr Gly Trp Gly Ala Leu Arg Glu Gly Gly Pro He Ser Asn Ala Leu 
130 135 140 



480 



528 



ggc aag aag gat gee tgt cag ggt gac tea ggt ggt ccg ctg gtg tge 576 
Gly Lys Lys Asp Ala Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys 
180 1B5 190 

aag gea etc agt ggc cgc tgg ttc ctg gcg ggg ctg gtc age tgg ggc 624 
Lys Ala Leu Ser Gly Arg Trp Phe Leu Ala Gly Leu Val Ser Trp Gly 
195 200 205 



672 



708 
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Gln Leu Asp His Pro Val Val Arg Ser Ala Ala Val Arg Pro Val Cys 

100 105 110 

Leu Pro Ala Arg Ser His Phe Phe Glu Pro Gly Leu His Cys Trp He 

X15 120 125 

Thr Gly Trp Gly Ala Leu Arg Glu Gly Gly Pro He Ser Asn Ala Leu 

130 135 140 

Gin Lys Val Asp Val Gin Leu He Pro Gin Asp Leu Cys Ser Glu Val 
145 150 155 ISO 

TVr Arq Tyx Gin Val Thr Pro Arg Met Leu Cys Ala Gly -Tyr Arg Lys 

165 170 175 

Gly Lys Lys Asp Ala Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys 

180 185 190 

Lys Ala Leu Ser Gly Arg Trp Phe Leu Ala Gly Leu Val Ser Trp Gly 

195 200 205 

Leu Gly Cys Gly Arg Pro Asn Tyr Phe Gly Val Tyr Thr Arg He Thr 

210 215 220 

Gly Val He Ser Trp He Gin Gin Val Val Thr 
225 230 235 

<210> 7 

<211> 3104 

<212> DMA 

<213> Homo Sapien 

<220> 
<221> CDS 

<222> (33) . . . (2441) ^ . . ^ 

<223> Nucleic acid encoding MTSP4-L (long form) splice variant 

tca^cggcca gagggtgatc agtgagcaga ag atg ccc gtg gcc gag gcc ccc 53 



Met Pro Val Ala Glu Ala Pro 
1 5 

cag gtg get ggc ggg cag ggg gac gga ggt gat ggc gag gaa gcg gag 
Gin Val Ala Gly Gly Gin Gly Asp Gly Gly Asp Gly Glu Glu Ala Glu 
10 . 15 20 . 

i 

ccg gag ggg atg ttc aag gcc tgt gag gac tec aag aga aaa gcc egg 
Pro Glu Gly Met Phe Lys Ala Cys Glu Asp Ser Lye Arg Lys Ala Arg 
25 30 35 

ggc tac etc cgc ctg gtg ccc ctg ttt gtg ctg ctg gcc ctg etc gtg 
gj ^ Leu Arg Leu Val Pro Leu Phe Val Leu Leu Ala Leu Leu Val 
40 45 50 55 

ctQ get teg gcg ggg gtg eta etc tgg tat ttc eta ggg tac aag gcg 
Leu Ala Se? Ala Gly Val Leu Leu Trp Tyr Phe Leu Gly Tyr Lys Ala 

60 65 70 

qag gtg atg gte age cag gtg tac tea ggc agt ctg cgt gta etc aat 
Glu Val Met Val Ser Gin Val Tyr Ser Gly Ser Leu Arg Val Leu Asn 
75 80 85 

cgc cac ttc tee cag gat ctt ace cgc egg gaa tct agt gcc ttc cgc 
Arci His Phe Ser Gin Asp Leu Thr Arg Arg Glu Ser Ser Ala Phe Arg 
^ 90 95 100 

agt gaa ace gee aaa gee cag aag atg etc aag gag etc ate ace age 
Ser Glu Thr Ala Lys Ala Gin Lys Met Leu Lys Glu Leu He Thr Ser 
105 110 lis 



101 



149 



197 



245 



293 



341 



389 
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acc cgc ctg gga act tac tac aac tec age tec gte tat tec ttt ggg 437 
Thr Arg Leu Gly Thr Tyr Tyr Asn Ser Ser Ser Val Tyr Ser Phe Gly 
120 125 130 135 

gag gga ccc etc acc tgc ttc ttc tgg ttc att etc caa ate ccc gag 485 
Glu Gly Pro Leu Thr Cys Phe Phe Trp Phe He Leu Gin He Pro Glu 

140 145 150 

cac cgc egg ctg atg ctg age ccc gag gtg gtg cag gca ctg ctg gtg 533 
His Arg Arg Leu Met Leu Ser Pro Glu Val Val Gin Ala Leu Leu Val 
155 160 165 

gag gag ctg ctg tec aea gte aac age teg get gee gte ccc tac agg 581 
Glu Glu Leu Leu Ser Thr Val Asn Ser Ser Ala Ala Val Pro Tyr Arg 
170 175 180 



gee gag tac gaa gtg gac ccc gag gge eta gtg ate ctg gaa gcc agt 
Ala Glu Tyr Glu Val Asp Pro Glu Gly Leu Val He Leu Glu Ala Ser 
185 190 195 

gtg aaa gac at a get gca ttg aat tec acg ctg ggt tgt tac cgc tac 
Val Lys Asp He Ala Ala Leu Asn Ser Thr Leu Gly Cys Tyr Arg Tyr 
200 205 210 215 



ggg gcc ate atg geg gte gte tgg aag aag gge ctg cae age tac tac 
Gly Ala He Met Ala Val Val Trp Lys Lys Gly Leu His Ser Tyr Tyr 

300 305 310 

gac ccc ttc gtg etc tec gtg cag ccg gtg gte ttc eag gcc tgt gaa 
Asp Pro Phe Val Leu Ser Val Gin Pro Val Val Phe Gin Ala Cys Glu 
315 320 325 

gtg aac ctg aeg ctg gae aac agg etc gac tec eag gge gte etc age 
Val Asn Leu Thr Leu Asp Asn Arg Leu Asp Ser Gin Gly Val Leu Ser 
330 335 340 

acc ccg tac ttc ccc age tac tac teg ccc caa acc cac tgc tec tgg 
Thr Pro Tyr Phe Pro Ser Tyr Tyr Ser Pro Gin Thr His Cys Ser Trp 
345 350 355 



629 



677 



age tac gtg gge cag gge cag gte etc egg ctg aag ggg cct gae eac 725 
Ser Tyr Val Gly Gin Gly Gin Val Leu Arg Leu Lys Gly Pro Asp His 

220 225 ' 230 

ctg gcc tec age tgc ctg tgg cac ctg cag gge ccc aag gac etc atg 773 
Leu Ala Ser Ser Cys Leu Trp His Leu Gin Gly Pro Lys Asp Leu Met 
235 240 245 

etc aaa etc egg ctg gag tgg acg ctg gca gag tgc egg gac cga ctg 821 
Leu Lys Leu Arg Leu Glu Trp Thr Leu Ala Glu Cys Arg Asp Arg Leu 
250 255 260 

gee atg tat gac gtg gcc ggg ccc ctg gag aag agg etc ate acc teg 869 
Ala Met Tyr Asp Val Ala Gly Pro Leu Glu Lys Arg Leu He Thr Ser 
265 270 275 

gtg tac gge tgc age cgc cag gag ccc gtg gtg gag gtt ctg gcg teg 917 
Val Tyr Gly Cys Ser Arg Gin Glu Pro Val Val Glu Val Leu Ala Ser 
280 285 290 295 



965 



1013 



1061 



1109 
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cac etc acg gtg ccc tct ctg gac tac ggc ttg gcc etc tgg ttt gat 1157 
His Leu Thr Val Pro Ser Leu Asp Tyr Gly Leu Ala Leu Trp Phe Asp 
3G0 365 370 375 

gcc tat gca ctg agg agg cag aag tat gat ttg ccg tgc acc cag ggc 1205 
Ala Tyr Ala Leu Arg Arg Gin Lys Tyr Asp Leu Pro Cya Thr Gin Gly 

380 385 390 

cag tgg acg ate cag aac agg agg ctg tgt ggc ttg cgc ate ctg cag 1253 
Gin Trp Thr lie Gin Asn Arg Arg Leu Cys Gly Leu Arg lie Leu Gin 
395 400 405 

ccc tac gcc gag agg ate ccc gtg gtg gcc acg gcc ggg ate acc ate 1301 
Pro Tyr Ala Glu Arg lie Pro Val Val Ala Thr Ala Gly lie Thr lie 
410 415 420 

aac ttc acc tec cag ate tec etc acc ggg ccc ggt gtg egg gtg cac 134 9 

Asn Phe Thr Ser Gin lie Ser Leu Thr Gly Pro Gly Val Arg Val His 
425 430 435 

tat ggc ttg tac aac cag teg gac ccc tgc cct gga gag ttc etc tgt 1397 
Tyr Gly Leu Tyr Asn Gin Ser Asp Pro Cys Pro Gly Glu Phe Leu Cys 
440 445 450 455 

tct gtg aat gga etc tgt gtc cct gcc tgt gat ggg gtc aag gac tgc 1445 
Ser Val Asn Gly Leu Cys Val Pro Ala Cys Asp Gly Val Lys Asp Cys 

460 465 470 

ccc aac ggc ctg gat gag aga aac tgc gtt tgc aga gcc aca ttc cag 1493 
Pro Asn Gly Leu Asp Glu Arg Asn Cys Val Cys Arg Ala Thr Phe Gin 
475 480 485 

tgc aaa gag gac age aca tgc ate tea ctg ccc aag gtc tgt gat ggg 1541 
Cys Lys Glu Asp Ser Thr Cys lie Ser Leu Pro Lys Val Cys Asp Gly 
490 495 500 

cag cct gat tgt etc aac ggc age gac gaa gag cag tgc cag gaa ggg 1589 
Gin Pro Asp Cys Leu Asn Gly Ser Asp Glu Glu Gin Cys Gin Glu Gly 
505 510 515 

gtg cca tgt ggg aca ttc ace ttc cag tgt gag gac egg age tgc gtg 1637 
Val Pro Cys Gly Thr Phe Thr Phe Gin Cys Glu Asp Arg Ser Cys Val 
520 525 530 535 

aag aag ccc aac ccg cag tgt gat ggg egg ccc gac tgc agg gac ggc 1685 
Lys Lys Pro Asn Pro Gin Cys Asp Gly Arg Pro Asp Cys Arg Asp Gly 

540 545 550 

teg gat gag gag eac tgt gaa tgt ggc etc cag ggc ccc tec age cgc 1733 
Ser Asp Glu Glu His Cys Glu Cys Gly Leu Gin Gly Pro Ser Ser Arg 
555 560 565 

att gtt ggt gga get gtg tec tee gag ggt gag tgg cca tgg cag gee 1781 
He Val Gly Gly Ala Val Ser Ser Glu Gly Glu Trp Pro Trp Gin Ala 
570 575 580 

age etc cag gtt egg ggt cga cac ate tgt ggg ggg gcc etc ate get 1829 
Ser Leu Gin Val Arg Gly Arg His lie Cys Gly Gly Ala Leu He Ala 
585 590 595 

gac cgc tgg gtg ata aca get gcc cac tgc ttc cag gag gac age atg 1877 
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Asp Arg Trp Val lie Thr Ala Ala His Cys Phe Gin Glu Asp Ser Met 
600 605 610 615 

gcc tec acg gtg ctg tgg acc gtg ttc ctg ggc aag gtg tgg cag aac 1925 
Ala Ser Thr Val Leu Trp Thr Val Phe Leu Gly Lys Val Trp Gin Asn 

620 625 630 

teg cgc tgg cct gga gag gtg tec ttc aag gtg age cge ctg etc ctg 1973 
Ser Arg Trp Pro Gly Glu Val Ser Phe Lys Val Ser Arg Leu Leu Leu 
635 640 645 

cac ccg tac cac gaa gag gac age cat gac tac gae gtg gcg ctg ctg 2021 
His Pro Tyr His Glu Glu Asp Ser His Asp Tyr Asp Val Ala Leu Leu 
650 655 660 

cag etc gac cac ccg gtg gtg cgc teg gcc gee gtg cgc cee gtc tgc 2069 
Gin Leu Asp His Pro Val Val Arg Ser Ala Ala Val Arg Pro Val Cys 
665 670 675 

ctg cce gcg cge tee cac ttc tte gag ecc ggc ctg cac tgc tgg att 2117 
Leu Pro Ala Arg Ser His Phe Phe Glu Pro Gly Leu His Cys Trp lie 
680 685 690 695 

acg ggc tgg ggc gee ttg cgc gag ggc ggc ecc ate age aac get ctg 2165 
Thr Gly Trp Gly Ala Leu Arg Glu Gly Gly Pro lie Ser Asn Ala Leu 

700 705 710 

cag aaa gtg gat gtg cag ttg ate cca cag gac ctg tgc age gag gtc 2213 
Gin Lys Val Asp Val Gin Leu lie Pro Gin Asp I*eu Cys Ser Glu Val 
715 720 725 

tat cgc tac cag gtg aeg cca cgc atg ctg tgt gcc ggc tac cgc aag 2261 
Tyr Arg Tyr Gin Val Thr Pro Arg Met Leu Cys Ala Gly Tyr Arg Lys 
730 735 740 

ggc aag aag gat gcc tgt cag ggt gae tea ggt ggt ccg ctg gtg tgc 2309 
Gly Lys Lys Asp Ala Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys 
745 750 755 

aag gca etc agt ggc cgc tgg ttc ctg gcg ggg ctg gtc age tgg ggc 2357 
Lys Ala Leu Ser Gly Arg Trp Phe Leu Ala Gly Leu Val Ser Trp Gly 
760 765 770 775 

Ctg ggc tgt ggc egg cct aac tac ttc ggc gtc tac acc cgc ate aea 2405 
Leu Gly Cys Gly Arg Pro Asn Tyr Phe Gly Val Tyr Thr Arg lie Thr 

780 785 790 

ggt gtg ate age tgg ate cag caa gtg gtg acc tga ggaactgccc 2451 
Gly Val lie Ser Trp lie Gin Gin Val Val Thr * 
795 800 

ccctgcaaag cagggcccac ctcctggact cagagagcce agggcaactg ccaagcaggg 2511 

ggacaagtat tctggcgggg ggtgggggag agagcaggcc ctgtggtggc aggaggggca 2571 

tcttgtttcg tccctgatgt ctgtccagta tggcaggagg atgagaagtg ccagcagttg 2631 

ggggtcaaga cgtcccttga ggaeccaggc ccaeacccag cccttttgcc tcccaattct 2691 

ctctcctecg tccccttcet ccactgctge ctaatgcaag gcagtggcte agcagcaaga 2751 

atgctggttc tacatcccga ggagtgtctg aggtgcgccc eactctgtae agaggctgtt 2 811 

tgggcagcct tgcetccaga gageagatte eagcttegga agcccctggt ctaacttggg 2871 

atctgggaat ggaaggtgct cccateggag gggaccctca gagccctgga gactgecagg 2931 

tgggcctgct gccactgtaa gccaaaaggt ggggaagtcc tgactccagg gtccttgccc 2991 

caccectgcc tgccaectgg gccctcacag cccagaccct cactgggagg tgagctcagc 3051 
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tgccctttgg aataaagctg cctgatgcaa aaaaaaaaaa aaaaaaaaaa aaa 3104 

<210> 8 

<211> 802 

<212> PRT 

<213> Homo Sapien 



<400> 8 



Met 


Pro 


Val 


Ala 


Glu 


Ala 


Pro 


Gin 


Val 


Ala 


Gly Gly 


Gin 


Gly 


Asp 


Gly 


1 








5 










10 










lb 




Gly Asp 


Gly 


Glu 


Glu 


Ala 


Glu 


Pro 


Glu 


Gly 


Met 


Phe 


Lys 


Ala 


^^w _ — 

Cys 


Glu 






20 










25 










jO 






Asp 


Ser 


Lys 
35 


Arg 


Lys 


Ala 


Arg 


Gly 
40 


Tyr 


Leu 


Arg 


Leu 


Val 

A C 


Pro 


Leu 


Pne 


Val 


Leu 


Leu 


Ala 


Leu 


Leu 


Val 


Leu 


Ala 


Ser 


Ala Gly 


Val 


Leu 


Leu 


Trp 




50 










55 










60 










Tyr 


Phe 


Leu 


Gly 


Tyr 


Lys 


Ala 


Glu 


Val 


Met 


Val 


Ser 


Gin 


Val 


Tyr 


Ser 


65 






70 










75 










80 


Gly 


Ser 


Leu 


Arg 


Val 


Leu 


Asn 


Arg 


His 


Phe 


Ser 


Gin 


Asp 


Leu 


Thr 


Arg 






85 










90 










95 




Arg 


Glu 


Ser 


Ser 


Ala 


Phe 


Arg 


Ser 


Glu 


Thr 


Ala 


Lys 


Ala 


Gin 


Lys 


Met 






100 










105 










110 






Leu 


Lys 


Glu 


Leu 


He 


Thr 


Ser 


Thr 


Arg 


Leu 


Gly Thr 


Tyr 


Tyr 


Asn 


Ser 




115 










12 0 










125 








Ser 


Ser 


Val 


Tyr 


Ser 


Phe 


Gly 


Glu 


Gly 


Pro 


Leu 


Thr 


Cys 


Phe 


w>l_ 

Phe 


Trp 




130 








135 










140 










Phe 


He 


Leu 


Gin 


He 


Pro 


Glu 


WW • 

His 


Arg 


Arg 


Leu 


Met 


Leu 


Ser 


Pro 


Glu 


145 










150 










155 










loU 


Val 


Val 


Gin 


Ala 


Leu 
165 


Leu 


Val 


Glu 


Glu 


Leu 
170 


Leu 


Ser 


Thr 


Val 


Asn 
175 


Ser 


Ser 


Ala 


Ala 


Val 


Pro 


Tyr 


Arg 


Ala 


Glu 


Tyr 


Glu 


Val 


Asp 


Pro 


Glu 


Gly 








180 








185 










190 






Leu 


Val 


He 
195 


Leu 


Glu 


Ala 


Ser 


Val 
200 


Lys 


Asp 


He 


Ala 


Ala 
205 


Leu 


Asn 


Ser 


Thr 


Leu 


Gly 


Cys 


Tyr 


Arg 


Tyr 


Ser 


Tyr 


Val 


Gly Gin 


Gly 


Gin 


Val 


Leu 




210 










215 










220 










Arg 


Leu 


Lys 


Gly 


Pro 


Asp 


His 


Leu 


Ala 


Ser 


Ser 


Cys 


Leu 


Trp 


His 


Leu 


225 






230 










235 










240 


Gin Gly 


Pro 


Lys 


Asp 


Leu 


Met 


Leu 


Lys 


Leu 


Arg 


Leu 


Glu 


Trp 


Thr 


Leu 








245 










250 










255 




Ala 


Glu 


Cys 


Arg 
260 


Asp 


Arg 


Leu 


Ala 


Met 
265 


Tyr 


Asp 


Val 


Ala 


Gly 
270 


Pro 


Leu 


Glu 


Lys 


Arg 
275 


Leu 


He 


Thr 


Ser 


Val 
280 


Tyr 


Gly 


Cys 


Ser 


Arg 
285 


Gin 


Glu 


Pro 


Val 


Val 
290 


Glu 


Val 


Leu 


Ala 


Ser 
295 


Gly 


Ala 


He 


Met 


Ala 
300 


Val 


Val 


Trp 


Lys 


Lys 


Gly 


Leu 


His 


Ser 


Tyr 


Tyr 


Asp 


Pro 


Phe 


Val 


Leu 


Ser 


Val 


Gin 


Pro 


305 










310 










31S 










320 


Val 


Val 


Phe 


Gin 


Ala 
325 


Cys 


Glu 


Val 


Asn 


Leu 
330 


Thr 


Leu 


Asp 


Asn 


Arg 
335 


Leu 


Asp 


Ser 


Gin 


Gly 


Val 


Leu 


Ser 


Thr 


Pro 


Tyr 


Phe 


Pro 


Ser 


Tyr 


Tyr 


Ser 






340 










345 










350 






Pro 


Gin 


Thr 


His 


Cys 


Ser 


Trp 


His 


Leu 


Thr 


Val 


Pro 


Ser 


Leu 


Asp 


Tyr 






355 








360 










365 








Gly 


Leu 
370 


Ala 


Leu 


Trp 


Phe 


Asp 
375 


Ala 


Tyr 


Ala 


Leu 


Arg 
380 


Arg 


Gin 


Lys 


Tyr 


Asp 


Leu 


Pro 


Cys 


Thr 


Gin 


Gly 


Gin 


Trp 


Thr 


He 


Gin 


Asn 


Arg 


Arg 


Leu 


385 








390 










395 










400 


Cfys 


Gly 


Leu 


Arg 


He 


Leu 


Gin 


Pro 


Tyr 


Ala 


Glu Arg 


He 


Pro 


Val 


Val 






405 










410 










415 




Ala 


Thr 


Ala 


Gly 


He 


Thr 


He 


Asn 


Phe 


Thr 


Ser 


Gin 


He 


Ser 


Leu 


Thr 
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420 



Glv 


Pro 

f 't— w 


Gly Val 


Arg Val 


His 


Tyr 






435 










440 


v-ya 


f ^ w 


Gly Glu 


Phe 


Leu 


Cys 


Ser 




A t; n 










455 




Cys 


nop 


Gly Val 


Lys 


Asp 


Cys 


Pro 


4 OS 










470 






WAX 




Arg 


Ala 


Thr 


Phe 


Gin 


Cys 










485 








Leu 




Lys 


Val 


Cys 


Asp 


Gly Gin 








500 












fin n 


Gin 


Cys 


Gin 


Glu 


Gly Val 






515 










52 0 






Asp Arg 


Ser 


Cys 


Val 


Lys 




con 










535 




Arg 


irXO 


Asp 


Cys 


Arg Asp 


Gly Ser 












550 






liCU 




Gly 


Pro 


Ser 


Ser 


Arg 


He 










565 












Trp 


Pro 


Trp 


Gin 


Ala 


Ser 








560 










Cys 




Gly Ala 


Leu 


He 


Ala Asp 






595 










600 


CyB 


Pne 


Gin 


Glu 


Asp 


Ser 


Met 


Ala 


610 










615 




Lieu 


Gly 


Lys 


Val 


Trp 


Gin 


Asn 


Ser 


625 










630 






Iiys 


vax 


Ser Arg 


Leu 


Leu 


Leu 


His 








645 








Asp 


Tyr 


Asp 


Val 


Ala 


Leu 


Leu 


Gin 








660 










Ala 


Ala 


Val 


Arg 


Pro 


Val 


Cys 


Leu 






675 










680 


Pro 


Gly 


Leu 


His 


Cys 


Trp 


He 


Thr 




690 










695 




Gly 


Pro 


He 


Ser 


Asn 


Ala 


Leu 


Gin 


705 










710 






Gin 


Asp 


Leu 


Cys 


Ser 


Glu 


Val 


Tyr 










725 








Leu 


Cys 


Ala 


Gly 


Tyr 


Arg 


Lys 


Gly 








740 










Ser 


Gly 


Gly 


Pro 


Leu 


Val 


Cys 


Lys 






755 










760 


Ala 


Gly 


Leu 


Val 


Ser 


Trp 


Gly Leu 




770 










775 




Gly 


Val 


Tyr 


Thr 


Arg 


He 


Thr Gly 


785 










790 






Val 


Thr 















425 






430 






Gly 


Leu 


Tyr Asn Gin 


Ser 


Asp 


Pro 






445 








Val 


Asn 


Gly Leu Cys 


Val 


Pro 


Ala 






460 








Asn Gly Leu Asp Glu 


Arg 


Asn 


Cys 






475 






480 


Lys 


Glu Asp Ser Thr 


Cys 


He 


Ser 




490 






495 




Pro 


Asp 


Cys Leu Asn 


Gly 


Ser 


Asp 


505 






510 






Pro 


Cys 


Gly Thr Phe 


Thr 


Phe 


Gin 






525 








Lys 


Pro 


Asn Pro Gin 


Cys 


Asp Gly 






3 540 








Asp 


Glu 


Glu His Cys 


Glu 


Cys 


Gly 






555 






560 


Val 


Gly Gly Ala Val 


Ser 


Ser 


Glu 




570 






575 




Leu Gin Val Arg Gly 


Arg 


His 


He 


585 






590 






Arg 


Trp 


Val He Thr 


Ala 


Ala 


His 




605 








Ser 


Thr 


Val Leu Trp 


Thr 


Val 


Phe 






620 








Arg 


Trp 


Pro Gly Glu 


Val 


Ser 


Phe 






635 






640 


Pro 


Tyr 


His Glu Glu 


Asp 


Ser 


His 




650 






655 




Leu 


Asp 


His Pro Val 


Val 


Arg 


Ser 


665 






670 






Pro Ala Arg Ser His 


Phe 


Phe 


Glu 






685 








Gly Trp Gly Ala Leu 


Arg 


Glu Gly 






700 








Lys 


Val 


Asp Val Gin 


Leu 


He 


Pro 




715 






72 0. 


Arg 


Tyr 


Gin Val Thr 


Pro Arg 


Met 




730 






735 




Lys 


Lys 


Asp Ala Cys 


Gin 


Gly Asp 


745 






750 






Ala 


Leu 


Ser Gly Arg 


Trp 


Phe 


Leu 






765 








Gly 


Cys 


Gly Arg Pro 


Asn 


Tyr 


Phe 






780 








Val 


He 


Ser Trp He 


Gin 


Gin 


Val 






795 






800 



<210> 9 

<211> 2672 

<212> DNA 

<213> Homo Sapien 

<220> 
<221> CDS 

<222> (33) . . - (2009) 

<223> cDNA encoding: MTSP4-S (short form) splice variant 



<400> 9 
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tcatcggcca gagggtgatc agtgagcaga ag atg ccc gtg gcc gag gcc ccc 53 

Met Pro Val Ala Glu Ala Pro 
1 5 

cag gtg get ggc ggg cag ggg gac gga ggt gat ggc gag gaa gcg gag 101 
Gin Val Ala Gly Gly Gin Gly Asp Gly Gly Asp Gly Glu Glu Ala Glu 
10 15 20 

ccg gag ggg atg ttc aag gcc tgt gag gac tec aag aga aaa gcc egg 149 
Pro Glu Gly Met Phe Lys Ala Cys Glu Asp Ser Lys Arg Lys Ala Arg 
25 30 35 



ggc tac etc cgc ctg gtg ccc ctg ttt gtg ctg ctg gcc ctg etc gtg 
Gly Tyx Leu Arg Leu Val Pro Leu Phe Val Leu Leu Ala Leu Leu Val 
40 45 50 55 



gtg aac ctg acg ctg gac aac agg etc gac tec cag ggc gtc etc age 
Val Asn Leu Thr Leu Asp Asn Arg Leu Asp Ser Gin Gly Val Leu Ser 
185 190 195 

ace ccg tac ttc ccc age tac tac teg ccc caa ace cae tgc tec tgg 
Thr Pro Tyr Phe Pro Ser Tyr Tyr Ser Pro Gin Thr His Cys Ser Trp 
200 205 210 215 



197 



ctg get teg gcg ggg gtg eta etc tgg tat ttc eta ggg tac aag gcg 245 
Leu Ala Ser Ala Gly Val Leu Leu Trp Tyr Phe Leu Gly Tyr Lys Ala 

60 65 70 

gag gtg atg gtc age cag gtg tac tea ggc agt ctg cgt gta etc aat 293 
Glu Val Met Val Ser Gin Val Tyr Ser Gly Ser Leu Arg Val Leu Asn 
75 80 85 

cgc cac ttc tec cag gat ctt acc cgc egg gaa tct agt gcc ttc cgc 341 
Arg His Phe Ser Gin Asp Leu Thr Arg Arg Glu Ser Ser Ala Phe Arg 
90 95 100 

agt gaa acc gcc aaa gcc cag aag atg etc aag gag etc ate acc age 
Ser Glu Thr Ala Lys Ala Gin Lys Met Leu Lys Glu Leu lie Thr Ser 
105 110 115 



389 



acc cgc ctg gga act tac tac aac tec age tee gtc tat tec ttt ggg 437 
Thr Arg Leu Gly Thr Tyr Tyr Asn Ser Ser Ser Val Tyr Ser Phe Gly 
120 125 130 135 

gtg tac ggc tgc age cgc cag gag ccc gtg gtg gag gtt ctg gcg teg 485 
Val Tyr Gly Cys Ser Arg Gin Glu Pro Val Val Glu Val Leu Ala Ser 

140 145 150 

ggg gcc ate atg gcg gtc gtc tgg aag aag ggc ctg cac age tac tac 533 
Gly Ala lie Met Ala Val Val Trp Lys Lys Gly Leu His Ser Tyr Tyr 
155 160 165 

gac ccc ttc gtg etc tec gtg cag ccg gtg gtc ttc cag gcc tgt gaa 581 
Asp Pro Phe Val Leu Ser Val Gin Pro Val Val Phe Gin Ala Cys Glu 
170 175 180 



629 



677 



eac etc acg gtg ccc tct ctg gac tac ggc ttg gcc etc tgg ttt gat 725 

His Leu Thr Val Pro Ser Leu Asp Tyr Gly Leu Ala Leu Trp Phe Asp 

220 225 230 

gcc tat gca ctg agg agg cag aag tat gat ttg ccg tgc acc cag ggc 773 
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Ala Tyr Ala Leu Arg Arg Gin Lys Tyr Asp Leu Pro Cys Thr Gin Gly 
235 240 245 

•cag tgg acg ate cag aac agg agg ctg tgt ggc ttg cgc ate ctg cag 821 
Gin Trp Thr lie Gin Asn Arg Arg Leu Cys Gly Leu Arg lie Leu Gin 
250 255 260 



ccc tac gcc gag agg ate ccc gtg gtg gcc acg gee ggg ate acc ate 
Pro Tyr Ala Glu Arg lie Pro Val Val Ala Thr Ala Gly lie Thr He 
265 270 275 



869 



aac ttc acc tec cag ate tec cte ace ggg ccc ggt gtg egg gtg cac 917 
Asn Phe Thr Ser Gin He Ser Leu Thr Gly Pro Gly Val Arg Val His 
280 285 290 295 

tat ggc ttg tac aac cag teg gac ccc tgc cct gga gag ttc etc tgt 965 
Tvr Glv Leu Tyr Asn Gin Ser Asp Pro Cys Pro Gly Glu Phe Leu Cys 

300 305 310 

tct gtg aat gga etc tgt gtc cct gee tgt gat ggg gtc aag gac tgc 1013 
Ser Val Asn Gly Leu Cys Val Pro Ala Cys Asp Gly Val Lys Asp Cys 
315 320 325 

ccc aac ggc ctg gat gag aga aac tgc gtt tgc aga gcc aca ttc cag 1061 
Pro Asn Gly Leu Asp Glu Arg Asn Cys Val Cys Arg Ala Thr Phe Gin 
330 335 340 

tgc aaa gag gac age aca tgc ate tea ctg ccc aag gtc tgt gat ggg 1109 
Cvs Lvs Glu Asp Ser Thr Cys He Ser Leu Pro Lys Val Cys Asp Gly 
345 350 355 

cag cct gat tgt etc aac ggc age gac gaa gag cag tgc cag gaa ggg 1157 
Gin Pro Asp Cys Leu Asn Gly Ser Asp Glu Glu Gin Cys Gin Glu Gly 
360 365 370 375 

gtg cca tgt ggg aca ttc acc ttc cag tgt gag gac egg age tgc gtg 1205 
Val Pro Cys Gly Thr Phe Thr Phe Gin Cys Glu Asp Arg Ser Cys Val 

380 385 390 

aag aag ccc aac cog cag tgt gat ggg egg ccc gac tgc agg gac ggc 1253 
Lys Lys Pro Asn Pro Gin Cys Asp Gly Arg Pro Asp Cys Arg Asp Gly 
395 400 405 

teg gat gag gag cac tgt gaa tgt ggc etc cag ggc ccc tec age cgc 13 01 

Ser Asp Glu Glu His Cys Glu Cys Gly Leu Gin Gly Pro Ser Ser Arg 
410 415 420 

att gtt ggt gga get gtg tec tec gag ggt gag tgg cca tgg cag gcc 1349 
He Val Gly Gly Ala Val Ser Ser Glu Gly Glu Trp Pro Trp Gin Ala 
425 430 435 

age etc cag gtt egg ggt cga cac ate tgt ggg ggg gcc etc ate get 1397 
Ser Leu Gin Val Arg Gly Arg His He Cys Gly Gly Ala Leu He Ala 
440 445 450 455 

gac cgc tgg gtg at a aca get gcc cac tgc ttc cag gag gac age atg 1445 
Asp Arg Trp Val He Thr Ala Ala His Cys Phe Gin Glu Asp Ser Met 

460 465 470 

gcc tec acg gtg ctg tgg acc gtg ttc ctg ggc aag gtg tgg cag aac 1493 
Ala Ser Thr Val Leu Trp Thr Val Phe Leu Gly Lys Val Trp Gin Asn 
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475 



480 



485 



teg cgc tgg cct gga gag gtg tec ttc aag gtg age cgc ctg etc ctg 
Ser Arg Trp Pro Gly Glu Val Ser Phe Lys Val Ser Arg lieu Leu Leu 
490 495 500 

cac ccg tac eac gaa gag gac age cat gac tae gac gtg gcg ctg ctg 
His Pro Tyr His Glu Glu Asp Ser His Asp Tyr Asp Val Ala Leu Leu 
505 510 515 

cag etc gac cac ccg gtg gtg cgc teg gee gee gtg cgc cce gtc tgc 
Gin Leu Asp His Pro Val Val Arg Ser Ala Ala Val Arg Pro Val Cys 
520 525 530 535 

Ctg ccc gcg cgc tec cac ttc ttc gag ccc ggc ctg cac tgc tgg att 
Leu Pro Ala Arg Ser His Phe Phe Glu Pro Gly Leu His Cys Trp He 

540 545 550 

acg ggc tgg ggc gee ttg cgc gag ggc ggc ccc ate age aac get ctg 
Thr Gly Trp Gly Ala Leu Arg Glu Gly Gly Pro He Ser Asn Ala Leu 
555 550 565 

cag aaa gtg gat gtg cag ttg ate cca cag gac ctg tgc age gag gtc 
Gin Lys Val Asp Val Gin Leu He Pro Gin Asp Leu Cys Ser Glu Val 
570 575. 580 

tat cgc tac cag gtg acg cca cgc atg ctg tgt gee ggc tac cgc aag 
Tyr Arg Tyr Gin Val Thr Pro Arg Met Leu Cys Ala Gly Tyr Arg Lys 
585 590 595 

ggc aag aag gat gee tgt cag ggt gac tea ggt ggt ccg ctg gtg tgc 
Gly Lys Lys Asp Ala Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys 
600 605 610 615 

aag gca etc agt ggc cgc tgg ttc ctg gcg ggg ctg gtc age tgg ggc 
LVB Ala Leu Ser Gly Arg Trp Phe Leu Ala Gly Leu Val Ser Trp Gly 

620 625 630 

ctg ggc tgt ggc egg cct aac tac ttc ggc gtc tac ace cgc ate aca 
Leu Gly Cys Gly Arg Pro Asn Tyr Phe Gly Val Tyr Thr Arg He Thr 
635 640 645 

ggt gtg ate age tgg ate cag caa gtg gtg ace tga ggaaetgccc 
Gly Val He Ser Trp He Gin Gin Val Val Thr * 
650 655 



ccctgcaaag 
ggacaagtat 
tcttgtttcg 
ggggtcaaga 
ctctcctceg 
atgctggttc 
tgggcagcct 
atctgggaat 
tgggcctgct 
cacccctgee 
tgccctttgg 

<210> 10 
<211> 658 
<212> PRT 



cagggcecac 
tctggegggg 
tecctgatgt 
egtcccttga 
tecccttcct 
tacateccga 
tgectccaga 
ggaaggtgct 
gccactgtaa 
tgc cac c tgg 
aataaagctg 



ctectggaet 
ggtgggggag 
ctgtceagta 
ggacccagge 
ccactgctge 
ggagtgtctg 
gagcagattc 
cecatcggag 
geeaaaaggt 
geccteacag 
cctgatgcaa 



cagagagccc 
agagcaggcc 
tggeaggagg 
ccaeacccag 
ctaatgeaag 
aggtgcgccc 
cagcttegga 
gggaccctca 
ggggaagtec 
cccagacect 
aaaaaaaaaa 



agggcaactg 
ctgtggtgge 
atgagaagtg 
cccttttgee 
gcagtggcte 
cactctgtac 
agcccctggt 
gagcectgga 
tgactccagg 
caetgggagg 
aaaaaaaaaa 



ccaagcaggg 
aggaggggca 
ccagcagttg 
tcccaattet 
agcagcaaga 
agaggctgtt 
ctaacttggg 
gactgccagg 
gtecttgcec 
tgagctcagc 
aaa 



1541 



1589 



1637 



1685 



1733 



1781 



1B29 



1877 



1925 



1973 



2019 



2079 
2139 
2199 
2259 
2319 
2379 
2439 
2499 
2559 
2619 
2672 
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<;213> Homo Sapien 
<400> 10 

Met Pro Val Ala Glu Ala Pro Gin Val Ala Gly Gly Gin Gly Asp Gly 

15 10 15 

Gly Asp Gly Glu Glu Ala Glu Pro Glu Gly Met Phe Lys Ala Cys Glu 

20 25 30 

Asp Ser Lys Arg Lys Ala Arg Gly Tyr Leu Arg Leu Val Pro Leu Phe 

35 40 45 

Val Leu Leu Ala Leu Leu Val Leu Ala Ser Ala Gly Val Leu Leu Trp 

50 55 60 

Tyr Phe Leu Gly Tyr Lys Ala Glu Val Met Val Ser Gin Val Tyr Ser 
65 70 75 80 

Gly Ser Leu Arg Val Leu Asn Arg His Phe Ser Gin Asp Leu Thr Arg 

B5 90 95 

Arg Glu Ser Ser Ala Phe Arg Ser Glu Thr Ala Lys Ala Gin Lys Met 

lOO 105 110 

Leu Lye Glu Leu lie Thr Ser Thr Arg Leu Gly Thr Tyr Tyr Asn Ser 

115 120 125 

Ser Ser Val Tyr Ser Phe Gly Val Tyr Gly Cys Ser Arg Gin Glu Pro 

130 135 140 

Val val Glu Val Leu Ala Ser Gly Ala He Met Ma Val Val Trp Lys 
145 150 155 160 

Lvs Glv Leu His Ser Tyr Tyr Asp Pro Phe Val Leu Ser Val Gin Pro 

165 170 175 

Val Val Phe Gin Ala Cys Glu Val Asn Leu Thr Leu Asp Asn Arg Leu 

IBO 1B5 190 

Asp Ser Gin Gly Val Leu Ser Thr Pro Tyr Phe Pro Ser Tyr Tyr Ser 

195 200 205 

Pro Gin Thr His Cys Ser Trp His Leu Thr Val Pro Ser Leu Asp Tyr 

210 215 220 

Glv Leu Ala Leu Trp Phe Asp Ala Tyr Ala Leu Arg Arg Gin Lys Tyr 
225 230 235 240 

Asp Leu Pro Cys Thr Gin Gly Gin Trp Thr He Gin Asn Arg Arg Leu 

245 250 255 

Cye Gly Leu Arg He Leu Gin Pro Tyr Ala Glu Arg He Pro Val Val 

260 265 270 

Ala Thr Ala Gly He Thr He Asn Phe Thr Ser Gin He Ser Leu Thr 

275 280 285 

Gly Pro Gly Val Arg Val His Tyr Gly Leu Tyr Asn Gin Ser Asp Pro 

290 295 300 

Cys Pro Gly Glu Phe Leu Cys Ser Val Asn Gly Leu Cys Val Pro Ala 
305 310 315 320 

Cys Asp Gly Val Lys Asp Cys Pro Asn Gly Leu Asp Glu Arg Asn Cys 

325 330 335 

Val Cys Arg Ala Thr Phe Gin Cys Lys Glu Asp Ser Thr Cys He Ser 

340 345 350 

Leu Pro Lys Val Cys Asp Gly Gin Pro Asp Cys Leu Asn Gly Ser Asp 

355 360 365 

Glu Glu Gin Cys Gin Glu Gly Val Pro Cys Gly Thr Phe Thr Phe Gin 

370 375 380 

Cys Glu Asp Arg Ser Cys Val Lys Lys Pro Asn Pro Gin Cys Asp Gly 
385 390 395 400 

Arg Pro Asp Cys Arg Asp Gly Ser Asp Glu Glu His Cys Glu Cys Gly 

405 410 415 

Leu Gin Gly Pro Ser Ser Arg He Val Gly Gly Ala Val Ser Ser Glu 

420 425 430 

Gly Glu Trp Pro Trp Gin Ala Ser Leu Gin Val Arg Gly Arg His He 

435 440 445 

Cys Gly Gly Ala Leu He Ala Asp Tirg Trp Val He Thr Ala Ala His 
450 455 460 
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\JyB File 


Gin Glu Asp 


Ser 


Met Ala Ser 


Thr 


Val 


Leu 


Trp 


Thr 


Val 


Phe 


4 w 3 






470 






475 










480 


ijcU tjrJ.y 


Lys Val 


Trp 


Gin 


Asn Ser Ttrg 


Trp 


Pro 


Gly 


Glu 


Val 


Ser 


Phe 


485 






490 










495 


His 


ijys vax 


Ser Arg 


Leu 


Leu 


Leu His Pro 


Tyr 


His 


Glu 


Glu Asp 


Ser 


500 






505 










510 








Asp Val 


Ala 


Leu 


Leu Gin Leu 


Asp 


His 


Pro 


Val 


Val 


Arg 


Ser 


515 






520 








525 








Ala Ala 


Val Arg 


Pro 


Val 


Cys Leu Pro 


Ala 


Arg 


Ser 


His . 


.Phe 


Phe 


Glu 


530 






535 






540 










Pro Gly 


Leu His 


Cys 


Trp 


lie Thr Gly Trp Gly Ala 


Leu 


Arg 


Glu Gly 


545 




550 






555 










560 


Gly Pro 


lie Ser 


Asn 


Ala 


Leu Gin Lys 


Val 


Asp 


Val 


Gin 


Leu 


He 


Pro 




565 






570 










575 




Glxi Asp 


Leu Cys 


Ser 


Glu 


Val Tyr Arg 


Tyr 


Gin 


Val 


Thr 


Pro 


Arg 


Met 


580 






5B5 










590 






Leu Cys 


Ala Gly Tyr Arg 


Lys Gly Lys 


Lys 


Asp 


Ala 


Cys 


Gin 


Gly Asp 


595 






600 








605 








Ser Gly 


Gly Pro 


Leu 


Val 


Cys Lys Ala 


Leu 


Ser 


Gly 


Arg 


Trp 


Phe 


Leu 


610 






615 






620 








Phe 


Ala Gly 


Leu Val 


Ser 


Trp 


Gly Leu Gly Cys 


Gly Arg 


Pro 


Asn 


Tyr 


625 






630 






635 










640 


Gly Val 


Tyr Thr 


Arg 


lie 


Thr Gly Val 


lie 


Ser 


Trp 


lie 


Gin 


Gin 


Val 


645 






650 










655 





Val Thr 



<210> 11 

<211> 1656 

<212> DNA 

<213> Homo Sapien 

<220> 
<221> CDS 

<222> (268) . . . (1629) 

<223> Nucleic acid encoding a transmembrane serine 
protease (MTSP-6) protein 

<400> 11 

cgcccgggca ggtcagtaac actgtggcct actatctctt ccgtggtgcc atctacattt 60 

ttgggactcg ggaattatga ctgtttttgg ttaatcgata ctgaatgcgc tttgtgtgga 12 0 

ctgtcgaatt tcaaagattt accgtatgac caagatgcac ctgatgctac aagtataaat 180 

aggggaacaa atgctttctg ttcttcctcg gctaaggagg tagaggtgga ggcggagccg 240 

gatgtcagag gtcctgaaat agtcacc atg ggg gaa aat gat ccg cct get gtt 294 

Met Gly Glu Asn Asp Pro Pro Ala Val 
1 5 



gaa gcc ccc ttc tea ttc cga teg ctt ttt ggc ctt gat gat ttg aaa 

Glu Ala Pro Phe Ser Phe Arg Ser Leu Phe Gly Leu Asp Asp Leu Lye 

10 15 20 25 

ata agt cct gtt gca cca gat gca gat get gtt get gea cag ate ctg 

He Ser Pro Val Ala Pro Asp Ala Asp Ala Val. Ala Ala Gin He Leu 

30 35 40 

tea ctg ctg cca ttg aag ttt ttt cca ate ate gtc att ggg ate att 

Ser Leu Leu Pro Leu Lys Phe Phe Pro He He Val He Gly He He 
45 50 55 

gca ttg ata tta gca ctg gcc att ggt etg ggc ate cac ttc gae tgc 

Ala Leu He Leu Ala Leu Ala He Gly Leu Gly He His Phe Asp Cys 



342 



390 



438 



486 
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60 65 70 

tea ggg aag tac aga tgt cgc tea tec ttt aag tgt ate gag etg ata 534 

Ser Gly Lys Tyr Arg Cys Arg Ser Ser Phe Lys Cys lie Glu Leu lie 
75 80 85 

get cga tgt gae gga gtc teg gat tge aaa gac ggg gag gac gag tac 582 

Ala Aarg Cys Asp Gly Val Ser Asp Cys Lys Asp Gly Glu Asp Glu Tyr 
90 95 100 105 



cgc tgt gtc egg gtg ggt ggt cag aat gee gtg etc cag gtg ttc aca 
Arg Cys Val Arg Val Gly Gly Gin Asn Ala Val Leu Gin Val Phe Thr 

110 115 120 

get get teg tgg aag aec atg tgc tec gat gae tgg aag ggt cac tac 
Ala Ala Ser Trp Lys Thr Met Cys Ser Asp Asp Trp Lys Gly His Tyr 
125 130 135 

gca aat gtt gee tgt gee caa etg ggt ttc eca age tat gta agt tea 
Ala Asn Val Ala Cys Ala Gin Leu Gly Phe Pro Ser Tyr Val Ser Ser 
140 145 150 



gcc cea tec cac ttg gtg gag aag att gtc tac cac age aag tac aag 
Ala Pro Ser His Leu Val Glu Lys 'He Val Tyr His Ser Lys Tyr Lys 
2B5 290 295 

cca aag agg etg ggc aat gae ate gcc ctt atg aag etg gcc ggg cea 
Pro Lys Arg Leu Gly Asn Asp He Ala Leu Met Lye Leu Ala Gly Pro 
300 305 310 



630 



678 



726 



gat aac etc aga gtg age teg eta gag ggg cag ttc egg gag gag ttt 774 
Asp Asn Leu Arg Val Ser Ser Leu Glu Gly Gin Phe Arg Glu Glu Phe 
155 160 165 



gtg tee ate gat cac etc ttg cea gat gac aag gtg act gca tta cac 822 
Val Ser He Asp His Leu Leu Pro Asp Asp Lys Val Thr Ala Leu His 
170 175 180 185 

cae tea gta tat gtg agg gag gga tgt gcc tct ggc cac gtg gtt ace 
His Ser Val Tyr Val Arg Glu Gly Cys Ala Ser Gly His Val Val Thr 

190 195 200 

ttg cag tge aca gcc tgt ggt cat aga agg ggc tac age tea cgc ate 
Leu Gin Cys Thr Ala Cys Gly His Arg Arg Gly Tyr Ser Ser Arg He 
205 210 215 

gtg ggt gga aac atg tec ttg etc teg cag tgg ccc tgg cag gcc age 
Val Gly Gly Asn Met Ser Leu Leu Ser Gin Trp Pro Trp Gin Ala Ser 
220 225 230 

ett cag ttc cag ggc tac cac etg tgc ggg ggc tet gtc ate acg ccc 
Leu Gin Phe Gin Gly Tyr His Leu Cys Gly Gly Ser Val He Thr Pro 
235 240 245 

etg tgg ate ate act get gca cae tgt gtt tat gae ttg tac etc ccc 
Leu Trp He He Thr Ala Ala His Cys Val Tyr Asp Leu Tyr Leu Pro 
250 255 260 265 

aag tea tgg ace ate cag gtg ggt eta gtt tec etg ttg gac aat cca 1110 
Lye Ser Trp Thr He Gin Val Gly Leu Val Ser Leu Leu Asp Asn Pro 

270 275 280 



870 



918 



966 



1014 



1062 



1158 



1206 
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ctc acg ttc aat gaa atg ate cag cct gtg tgc ctg ccc aac tct gaa 1254 
Leu Thr Phe Asn Glu Met lie Gin Pro Val Cys Leu Pro Asn Ser Glu 
315 320 325 

gag aac ttc ccc gat gga aaa gtg tgc tgg acg tea gga tgg ggg gcc 13 02 

Glu Asn Phe Pro Asp Gly Lye Val Cys Trp Thr Ser Gly Trp Gly Ala 
330 335 340 345 

aca gag gat gga ggt gac gcc tec cct gtc ctg aac cac gcg gcc gtc 1350 
Thr Glu Asp Gly Gly Asp Ala Ser Pro Val Leu Asn His Ala Ala Val 

350 355 360 

cct ttg att tec aac aag ate tgc aac cac agg gac gtg tac ggt gge 1398 
Pro Leu lie Ser Asn Lys lie Cys Asn His Arg Asp Val Tyr Gly Gly 
365 370 375 

ate ate tec ccc tec atg etc tgc gcg ggc tac ctg acg ggt ggc gtg 1446 
He He Ser Pro Ser Met Leu Cys Ala Gly Tyr Leu Thr Gly Gly Val 
380 385 390 

gac age tgc cag ggg gac age ggg ggg ccc ctg gtg tgt caa gag agg 1494 
Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys Gin Glu Arg 
395 400 405 

agg ctg tgg aag tta gtg gga gcg acc age ttt ggc ate ggc tgc gca 1542 
Arg Leu Trp Lys Leu Val Gly Ala Thr Ser Phe Gly He Gly Cys Ala 
410 415 420 425 



gag gtg aac aag cct ggg gtg tac acc cgt gtc acc tec ttc ctg gac 
Glu Val Asn Lys Pro Gly Val Tyr Thr Arg Val Thr Ser Phe Leu Asp 

430 435 440 

tgg ate cac gag cag atg gag aga gac eta aaa acc tga agaggaaggg 
Trp He His Glu Gin Met Glu Arg Asp Leu Lys Thr * 
445 450 

gataagtagc cacctga 

<210> 12 

<211> 453 

<212> PRT 

<213> Homo Sapien 

<400> 12 , ^ « , 

Met Gly Glu Asn Asp Pro Pro Ala Val Glu Ala Pro Phe Ser Phe Arg 

1 5 10 15 

Ser Leu Phe Gly Leu Asp Asp Leu Lys He Ser Pro Val Ala Pro Asp 

20 25 30 

Ala Asp Ala Val Ala Ala Gin He Leu Ser Leu Leu Pro Leu Lys Phe 

35 40 45 

Phe Pro He He Val He Gly He He Ala Leu He Leu Ala Leu Ala 

50 55 60 

He Gly Leu Gly He His Phe Asp Cys Ser Gly Lys Tyr Arg Cys Arg 
65 70 75 80 

Ser Ser Phe Lys Cys He Glu Leu He Ala Arg Cys Asp Gly Val Ser 

85 90 95 

Asp Cys Lys Asp Gly Glu Asp Glu Tyr Arg Cys Val Arg Val Gly Gly 

100 105 110 

Gin Asn Ala Val Leu Gin Val Phe Thr Ala Ala Ser Trp Lys Thr Met 
115 120 125 



1590 



1639 



1656 
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Cys Ser Asp Asp Trp Lys Gly Hie Tyr Ala Asn Val Ala Cys Ala Gin 

130 135 140 

Leu Gly Phe Pro Ser Tyr Val Ser Ser Asp Asn Leu Arg Val Ser Ser 
145 150 155 160 

Leu Glu Gly Gin Phe Arg Glu Glu Phe Val Ser lie Asp Hie Leu Leu 

165 170 175 

Pro Asp Asp Lys Val Thr Ala Leu His His Ser Val Tyr Val Arg Glu 

180 185 190 

Gly Cys Ala Ser Gly His Val Val Thr Leu Gin Cys Thr Ala Cys Gly 

195 200 205 

His Arg Arg Gly Tyr Ser Ser Arg lie Val Gly Gly Asn Met Ser Leu 

210 215 220 

Leu Ser Gin Trp Pro Trp Gin Ala Ser Leu Gin Phe Gin Gly Tyr His 
225 230 235 240 

Leu Cys Gly Gly Ser Val lie Thr Pro Leu Trp He He Thr Ala Ala 

245 250 255 

His Cys Val Tyr Asp Leu Tyr Leu Pro Lys Ser Trp Thr He Gin Val 

260 265 270 

Gly Leu Val Ser Leu Leu Asp Asn Pro Ala Pro Ser His Leu Val Glu 

275 280 285 

Lys He Val Tyr His Ser Lys Tyr Lys Pro Lys Arg Leu Gly Asn Asp 

290 295 300 

He Ala Leu Met Lys Leu Ala Gly Pro Leu Thr Phe Asn Glu Met He 
305 310 315 320 

Gin Pro Val Cys Leu Pro Asn Ser Glu Glu Asn Phe Pro Asp Gly Lys 

325 330 335 

Val Cys Trp Thr Ser Gly Trp Gly Ala Thr Glu Asp Gly Gly Asp Ala 

340 345 350 

Ser Pro Val Leu Asn His Ala Ala Val Pro Leu He Ser Asn Lys He 

355 360 365 

Cys Asn His Arg Asp Val Tyr Gly Gly He He Ser Pro Ser Met Leu 

370 375 380 

Cvs Ala Gly Tyr Leu Thr Gly Gly Val Asp Ser Cys Gin Gly Asp Ser 
385 390 395 400 

Gly Gly Pro Leu Val Cys Gin Glu Arg Arg Leu Trp Lys Leu Val Gly 

405 410 415 

Ala Thr Ser Phe Gly He Gly Cye Ala Glu Val Asn Lys Pro Gly Val 

420 425 430 

Tyr Thr Arg Val Thr Ser Phe Leu Asp Trp He His Glu Gin Met Glu 

435 440 445 

Arg Asp Leu Lys Thr 
450 

<210> 13 
<211> 2100 
<212> DNA 
<213> Homo sapien 

<220> 

<221> CDS 

<222> (45) . . . (1361) 

<223> Nucleic acid encoding MTSP7 

<400> 13 

aoatcaQatQ gcgactgaat agaagctgcc ccagtcctgg gttc atg atg tac aca 56 

^ 3 3 3 => j^g^ ,py^ rjj^ 

1 



cct gtt gaa ttt tea gaa get gaa ttc tea cga get gaa tat caa aga 
Pro Val Glu Phe Ser Glu Ala Glu Phe Ser Arg Ala Glu Tyr Gin Arg 
5 10 15 20 



104 
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aag cag caa ttt tgg gac tea gta egg eta get ctt ttc aca tta gca 152 
Lys Gin Gin Phe Trp Asp Ser Val Arg Leu Ala Leu Phe Thr Leu Ala 

25 30 35 



att gta gea ate ata gga att gca att ggt att gtt act cat ttt gtt 

lie Val Ala He He Gly He Ala He Gly He Val Thr His Phe Val 

40 45 50 

gtt gag gat gat aag tct ttc tat tac ctt gee tot ttt aaa gtc aca 

Val Glu Asp Asp Lys Ser Phe Tyr Tyr Leu Ala Ser Phe Lys Val Thr 

55 60 65 

aat ate aaa tat aaa gaa aat tat gge ata aga tct tea aga gag ttt 

TVsn He Lys Tyr Lys Glu Asn Tyr Gly He Arg Ser Ser Arg Glu Phe 

70 75 80 



aaa cca tea ttt aga etc aca ect att gac age aaa aag atg agg aat 
Lvs Pro Ser Phe Arg Leu Thr Pro He Asp Ser Lys Lys Met Arg Asn 
^ 170 175 180 



165 



aca gca get eac tgc ttt tgg aaa aat aaa gae eca act caa tgg att 
Thr Ala Ala His Cys Phe Trp Lys Asn Lys Asp Pro Thr Gin Trp He 
245 250 255 260 



200 



248 



296 



ata gaa agg agt cat cag att gaa aga atg atg tct agg ata ttt cga 344 
He Glu Arg Ser His Gin He Glu Arg Met Met Ser Arg He Phe Arg 
B5 90 95 100 

cat tct tct gta ggc ggt cga ttt ate aaa tct cat gtt ate aaa tta 
His Ser Ser Val Gly Gly Arg Phe He Lys Ser His Val He Lys Leu 

105 110 115 

agt cea gat gaa caa ggt gtg gat att ctt ata gtg etc ata ttt cga 
Ser Pro Asp Glu Gin Gly Val Asp He Leu He Val Leu He Phe Arg 
120 125 130 

tac cca tct act gat agt get gaa caa ate aag aaa aaa att gaa aag 
Tyr Pro Ser Thr Asp Ser Ala Glu Gin He Lys Lys Lys He Glu Lys 
135 140 145 

get tta tat eaa agt ttg aag acc aaa caa ttg tct ttg acc ata aac 536 
Ala Leu Tyr Gin Ser Leu Lys Thr Lys Gin Leu Ser Leu Thr He Asn 
ISO 155 160 



392 



440 



488 



584 



Ctt etc aac agt cgc tgt gga ata agg atg aca tct tea aac atg cca 632 
Leu Leu Asn Ser Arg Cys Gly He Arg Met Thr Ser Ser Asn Met Pro 

185 190 195 

tta cca gca tec tct tct act caa aga att gtc caa gga agg gaa aca 680 
Leu Pro Ala Ser Ser Ser Thr Gin Arg He Val Gin Gly Arg Glu Thr 
200 205 210 

get atg gaa ggg gaa tgg cca tgg cag gee age etc cag etc ata ggg 728 
Ala Met Glu Gly Glu Trp Pro Trp Gin Ala Ser Leu Gin Leu He Gly 
215 220 225 

tea ggc cat cag tgt gga gee age etc ate agt aac aca tgg ctg etc 776 
Ser Gly His Gin Cys Gly Ala Ser Leu He Ser Asn Thr Trp Leu Leu 
230 235 240 



824 
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get act ttt ggt gca act ata aca cca ccc gca gtg aaa cga aat gtg 872 
Ala Thr Phe Gly Ala Thr lie Thr Pro Pro Ala Val Lys Arg Asn Val 

265 270 275 

agg aaa att att ctt cat gag aat tac cat aga gaa aca aat gaa aat 920 
Arg Lys lie lie Leu His Glu Asn Tyr His Arg Glu Thr Asn Glu Asn 
280 2B5 290 

gac att get ttg gtt cag etc tet act gga gtt gag ttt- tea aat ata 968 
Asp lie Ala Leu Val Gin Leu Ser Thr Gly Val Glu Phe Ser Asn lie 
295 300 305 

gtc cag aga gtt tgc etc cca gac tea tct ata aag ttg cca ect aaa 1016 
Val Gin Arg Val Cys Leu Pro Asp Ser Ser lie Lys Leu Pro Pro Lys 
310 315 320 

aca agt gtg ttc gtc aca gga ttt gga tec att gta gat gat gga ect 1064 
Thx Ser Val Phe Val Thr Gly Phe Gly Ser lie Val Asp Asp Gly Pro 
325 330 335 340 

ata caa aat aca ctt egg caa gee aga gtg gaa acc ata age act gat 1112 
lie Gin Asn Thr Leu Arg Gin Ala Arg Val Glu Thr lie Ser Thr Asp 

345 350 355 

gtg tgt aac aga aag gat gtg tat gat ggc ctg ata act cca gga atg 1160 
Val Cys Asn Arg Lys Asp Val Tyr Asp Gly Leu lie Thr Pro Gly Met 
360 365 370 

tta tgt get gga ttc atg gaa gga aaa ata gat gca tgt aag gga gat 1208 
Leu Cys Ala Gly Phe Met Glu Gly Lys He Asp Ala Cys Lys Gly Asp 
375 380 365 

tct ggt gga ect ctg gtt tat gat aat cat gac ate tgg tac att gta 1256 
Ser Gly Gly Pro Leu Val Tyr Asp Asn His Asp He Trp Tyr He Val 
390 395 400 

ggt ata gta agt tgg gga caa tea tgt gca ctt cec aaa aaa ect gga 13 04 
Gly He Val Ser Trp Gly Gin Ser Cys Ala Leu Pro Lye Lys Pro Gly 
405 410 415 420 

gtc tac acc aga gta act aag tat cga gat tgg att gee tea aag act 1352 
Val Tyr Thr Arg Val Thr Lys Tyr Arg Asp Trp He Ala Ser Lys Thr 

425 430 435 

ggt atg tag tgtggattgt ccatgagtta tacacatggc acacagagct 1401 
Gly Met * 

gataetcctg cgtattttgt attgtttaaa ttcatttact ttggattagt gcttttgcta 1461 

gatgteaaga agccctteag acccagacaa atctaatatc ctgaggtggc ctttacatac 1521 

gtaggaccaa accctetcta ccatgaggga agaagacaca gcaaatgaca gacagcacct 1581 

attccttact eacaagggaa aetgcttgtg atacttccta ataagataaa taagtggttt 1641 

ccctcaattg aagacaggaa catcattttc cacaggatat gaagagctgc cagtaatgcc 1701 

aaaatcttac ctcatataat acctggagca tgtgagattc ttctagtgaa aaagaaeagt 1761 

cttcectgaa gactcagggc ttcaacattc tagaactgat aagtggacct tcagtgtgca 1821 

agaatggaga agcatgggat ttgcattatg acttgaactg ggc t tat ate taataataca 1881 

gagcaetatc actaacctca acagttgaca ttttaaaagt ttttaaatgt atctgaactt 1941 

gctgttaaca cagtgttata actcaagcac tagettcagg aagcatgttg tgt tgt t aag 2001 

aagcttttct gatttattet ttaacagcat cttgceatct atatgttagt agcagttggc 2061 

ccagaaagga caaaaaaaaa aaaaaaaaaa aaaaaaaaa 2100 
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<210> 14 

<211> 438 

<212> PRT 

<213> Homo sapien 



<400> 14 



Met 


Met 


Tvr 


Thr 


Pro 


Val 


Glu 


Phe 


Ser 


Glu 


7U.a 


Glu 


Phe 


Ser 


Arq 


Ala 


1 






5 










10 










15 




Glu 


Tyr 


Gin 


Arg 


Lys 


Gin 


Gin 


Phe 


Trp 


Asp 


Ser 


Val 


Arg 


Leu 


Ala 


Leu 






20 










25 










30 






Phe 


Thr 


Leu 
35 


Ala 


He 


Val 


Ala 


He 
40 


He 


Gly 


He 


Ala 


He 
45 


Gly 


He 


Val 


Thr 


His 


Phe 


Val 


Val 


Glu Asp Asp 


Lys 


Ser 


Phe 


Tyr 


Tyr 


Leu 


Ala 


Ser 




50 










55 










60 














Val 


Thr 


Asn 


He 


Lys 


Tyr 


Lys 


Glu Asn Tyr Gly 


He 


Atq 


Ser 










70 










75 










80 






Glu 


Phe 


He 


Glu Arg 


Ser 


His 


Gin 


He 


Glu 


Arg 


Met 


Met 


Ser 








85 










90 










95 






He 


Phe 




His 


Ser 


Ser 


Val 


Gly Gly Arg 


Phe 


He 


Lvs 


Ser 


His 






100 










105 










110 






Val 


He 


Lys 


Leu 


Ser 


Pro 


Asp 


Glu 


Gin Gly Val Asp 


He 


Leu 


He 


Val 






115 










120 










125 








Leu 


He 
130 


Phe 


Arcr 


Tyr 


Pro 


Ser 
135 


Thr 


Asp 


Ser 


Ala 


Glu 
140 


Gin 


He 


Lys 


Lys 




He 


Glu 


Lys 


Ala 


Leu 


Tyr 


Gin 


Ser 


Leu 


Lys 


Thr 


Lys 


Gin 


Leu 


Ser 


145 










150 










155 










160 






He 


Asn 


Lys 
165 


Pro 


Ser 


Phe 


Arg 


Leu 
170 


Thr 


Pro 


He 


Asp 


Ser 
175 


Lys 










Leu 


Leu 


Asn 


Ser 


Arg 


Cys 


Gly 


He 


Arg 


Met 


Thr 


Ser 




180 










185 










190 








7!k a ri 


Met 




Leu 


Pro 


Ala 


Ser 


Ser Ser Thr Gin Arg 


He 


Val 


Gin 






195 










200 










205 












Glu 

w JU 


TViT" 


Ala 


Met 


Glu Gly 


Glu 


Trp 


Pro 


Trp 


Gin 


Ala 


Ser 


Leu 


210 










215 










220 












±JC LI 






Ser Gly His 


Gin 


Cys 


Gly Ala 


Ser 


Leu 


He 


Ser 


Asn 


225 








230 










235 










240 




Tim 




Leu 


Thr 


Ala 


Ala 


His 


Cys 


Phe 


Trp 


Lys 


Asn 


Lys 


Asp 


Pro 








245 










250 










255 




Thr 


Gin 


Trp 


He 


Ala 


Thr 


Phe 


Gly 


Ala 


Thr 


He 


Thr 


Pro 


Pro 


Ala 


Val 






260 










265 










270 






Lys 


Arg 


Asn 


Val 


Arg 


Lys 


He 


He 


Leu 


His 


Glu Asn Tyr 


His 


Arg 


Glu 


275 










280 










285 








Thr 


Asn 


Glu 


Asn 


Asp 


He 


Ala 


Leu 


Val 


Gin 


Leu 


Ser 


Thr 


Gly Val 


Glu 




290 








295 










300 










Phe 


Ser 


Asn 


He 


Val 


Gin Arg Val 


Cys 


Leu 


Pro 


Asp 


Ser 


Ser 


He 


Lys 


305 










310 










315 










320 


lieu 


Pro 


Pro 


Lys 


Thr 


Ser 


Val 


Phe 


Val 


Thr 


Gly 


Phe 


Gly 


Ser 


He 


Val 








325 










330 










335 




Asp 


Asp 


Gly 


Pro 


He 


Gin 


Asn 


Thr 


Leu 


Arg 


Gin 


Ala 


Arg 


Val 


Glu 


Thr 


340 










345 










350 






He 


Ser 


Thr 


Asp 


Val 


Cys 


Asn Arg 


Lys 


Asp 


Val 


Tyr 


Asp 


Gly 


Leu 


He 






355 










360 










365 








Thr 


Pro 


Gly 


Met 


Leu 


Cys 


Ala 


Gly 


Phe 


Met 


Glu 


Gly Lys 


He 


Asp 


Ala 




370 








375 










380 










Cys 


Lys 


Gly 


Asp 


Ser Gly 


Gly 


Pro 


Leu 


Val 


Tyr 


Asp 


Asn 


His 


Asp 


He 


3B5 










390 










395 










400 


Trp 


Tyr 


He 


Val 


Gly 


He 


Val 


Ser 


Trp Gly Gin 


Ser 


Cys 


Ala 


Leu 


Pro 






405 










410 










415 




Lys 


Lys 


Pro 


Gly 


Val 


Tyr 


Thr Arg 


Val 


Thr 


Lys 


Tyr Arg 


Asp 


Trp 


He 




420 










425 










430 






Ala 


Ser 


Lys 


Thr 


Gly Met 
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435 

<210> 15 

<211> 702 

<212> DNA 

<213> Homo sapien 

<220> 

<221> CDS 

<222> (1) . - . (702) 

<223> Nucleotide sequence encoding MTSP-7 Protease Domain 
<400> 15 

att gtc caa gga agg gaa aca get atg gaa ggg gaa tgg cca tgg cag 
lie Val Gin Gly Arg Glu Thr Ala Met Glu Gly Glu Trp Pro Trp Gin 
15 10 15 

gcc age etc cag etc at a ggg tea ggc cat eag tgt gga gee age etc 
Ala Ser Leu Gin Leu lie Gly Ser Gly His Gin Cys Gly Ala Ser Leu 
20 25 30 



aaa gac cca act caa tgg att get act ttt ggt gea act ata aea cca 
Lys Asp Pro Thr Gin Trp He Ala Thr Phe Gly Ala Thr He Thr Pro 
50 55 60 

ccc gca gtg aaa cga aat gtg agg aaa att att ctt cat gag aat tac 
Pro Ala Val Lys Arg Asn Val Arg Lys He He Leu His Glu Asn Tyr 
65 70 75 80 

cat aga gaa aca aat gaa aat gac att get ttg gtt cag etc tct act 
His Arg Glu Thr Asn Glu Asn Asp He Ala Leu Val Gin Leu Ser Thr 

85 90 95 



48 



96 



ate agt aac aca tgg ctg etc aca gca get cac tgc ttt tgg aaa aat 144 
He Ser Asn Thr Trp Leu Leu Thr Ala Ala His Cys Phe Trp Lys Asn 
35 40 45 



192 



240 



288 



gga gtt gag ttt tea aat ata gtc eag aga gtt tgc etc cca gac tea 336 

Gly Val Glu Phe Ser Asn He Val Gin Arg Val Cys Leu Pro Asp Ser 

100 105 110 

tct ata aag ttg cca cct aaa aca agt gtg ttc gtc aca gga ttt gga 3 84 

Ser He Lys Leu Pro Pro Lys Thr Ser Val Phe Val Thr Gly Phe Gly 
115 120 125 

tec att gta gat gat gga cct ata caa aat aca ctt egg caa gcc aga 432 

Ser He Val Asp Asp Gly Pro He Gin Asn Thr Leu Arg Gin Ala Arg 
130 135 140 

gtg gaa acc ata age act gat gtg tgt aac aga aag gat gtg tat gat 

Val Glu Thr He Ser Thr Asp Val Cys Asn Arg Lys Asp Val Tyr Asp 
145 150 155 160 

ggc ctg ata act cca gga atg tta tgt get gga ttc atg gaa gga aaa 

Gly Leu He Thr Pro Gly Met Leu Cys Ala Gly Phe Met Glu Gly Lys 

165 170 175 

ata gat gca tgt aag gga gat tct ggt gga cct ctg gtt tat gat aat 576 

He Asp Ala Cys Lys Gly Asp Ser Gly Gly Pro Leu Val Tyr Asp Asn 

180 185 190 



480 



528 
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cat gac ate tgg tac att gta ggt ata gta agt tgg gga caa tea tgt 624 
HiB ABp lie Trp Tyr lie Val Gly lie Val Ser Trp Gly Gin Ser Cys 
195 200 205 

gca ctt ccc aaa aaa cct gga gtc tac acc aga gta act aag tat cga 672 
Ala Leu Pro Lys Lys Pro Gly Val Tyr Thr Arg Val Thr Lys Tyr Arg 
210 215 220 

gat tgg att gcc tea aag act ggt atg tag 702 
Asp Trp lie Ala Ser Lys Thr Gly Met * 
225 230 

<210> 16 

<211> 233 

<212> PRT 

<213> Homo sapien 

<400> 16 

He Val Gin Gly Arg Glu Thr Ala Met Glu Gly Glu Trp Pro Trp Gin 

15 10 15 

Ala Ser Leu Gin Leu He Gly Ser Gly His Gin Cys Gly Ala Ser Leu 

20 25 30 

He Ser Asn Thr Trp Leu Leu Thr Ala Ala His Cys Phe Trp Lys Asn 

35 40 45 

Lys Asp Pro Thr Gin Trp He Ala Thr Phe Gly Ala Thr He Thr Pro 

50 55 60 

Pro Ala Val Lys Arg Asn Val Arg Lys He He Leu His Glu Asn Tyr 
65 70 75 80 

His Arg Glu Thr Asn Glu Asn Asp He Ala Leu Val Gin Leu Ser Thr 

- 85 90 95 

Gly Val Glu Phe Ser Asn He Val Gin Arg Val Cys Leu Pro Asp Ser 

100 105 110 

Ser He Lys Leu Pro Pro Lys Thr Ser Val Phe Val Thr Gly Phe Gly 

115 120 125 

Ser He Val Asp Asp Gly Pro He Gin Asn Thr Leu Arg Gin Ala Arg 

130 135 140 

Val Glu Thr He Ser Thr Asp Val Cys Asn Arg Lys Asp Val Tyr Asp 
145 150 155 160 

Gly Leu He Thr Pro Gly Met Leu Cys Ala Gly Phe Met Glu Gly Lys 

165 170 175 

He Asp Ala Cys Lys Gly Asp Ser Gly Gly Pro Leu Val Tyr Asp Asn 

180 185 190 

His Asp He Trp Tyr He Val Gly He Val Ser Trp Gly Gin Ser Cys 

195 200 205 

/aa Leu Pro Lys Lys Pro Gly Val Tyr Thr Arg Val Thr Lys Tyr Arg 

210 215 220 

Asp Trp He Ala Ser Lys Thr Gly Met 
225 230 

<210> 17 

<211> 777 

<212> DNA 

<213> Homo Sapien 

<220> 

<221> CDS 

<222> {1)...(729) 

<223> Nucleotide sequence encoding MTSP9, including 
protease domain (31-729) 
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<400> 17 

aaa cga gtt gtt cca tta aac gtc aac aga ata gca tct gga gtc att 48 
Lys Arg Val Val Pro Leu Asn Val Asn Arg lie Ala Ser Gly Val He 
15 10 15 

gca ccc aag gcg gcc tgg cct tgg caa get tec ctt cag tat gat aac 96 
Ala Pro Lys Ala Ala Txp Pro Trp Gin Ala Ser Leu Gin Tyr Asp Asn 
20 25 30 

ate cat cag tgt ggg gcc acc ttg att agt aac aca tgg ctt gtc act 144 
He His Gin Cys Gly Ala Thr Leu He Ser Asn Thr Trp Leu Val Thr 
35 40 45 

gca gca cac tgc ttc cag aag tat aaa aat cca cat caa tgg act gtt 192 
Ala Ala His Cys Phe Gin Lys Tyr Lys Asn Pro His Gin Trp Thr Val 
50 55 60 

agt ttt gga aca aaa ate aac cct ccc tta atg aaa aga aat gtc aga 240 
Ser Phe Gly Thr Lys He Asn Pro Pro Leu Met Lys Arg Asn Val Arg 
65 70 75 80 

aga ttt att ate cat gag aag tac cgc tct gca gca aga gag tac gac 288 
Arg Phe He He HiB Glu Lys Tyr Arg Ser Ala Ala Arg Glu Tyr Asp 

85 90 95 

att get gtt gtg cag gtc tct tec aga gtc acc ttt teg gat gac ata 336 
He Ala Val Val Gin Val Ser Ser Arg Val Thr Phe Ser Asp Asp He 
100 105 110 

CQC egg att tgt ttg cca gaa gcc tct gca tec ttc caa cca aat ttg 
Arg Arg He Cys Leu Pro Glu Ala Ser Ala Ser Phe Gin Pro Asn Leu 
115 120 125 



145 



150 155 160 



ggg gga cct tta gtc aca agg gat ctg aaa gat acg tgg tat etc att 
Gly Gly Pro Leu Val Thr Arg Asp Leu Lys Asp Thr Trp Tyr Leu He 
195 200 205 

gga att gta age tgg gga gat aac tgt ggt caa aag gac aag cct gga 
Gly He Val Ser Trp Gly Asp Asn Cys Gly Gin Lys Asp Lys Pro Gly 
210 215 220 

gtc tac aca caa gtg act tat tac cga aac tgg att get tea aaa aca 
Val Tyr Thr Gin Val Thr Tyr Tyr Arg Asn Trp He Ala Ser Lys Thr 
225 230 235 240 



384 



act gtc cac ate aca gga ttt gga gca ctt tac tat ggt ggg gaa tec 432 
Thr Val His He Thr Gly Phe Gly Ala Leu Tyr Tyr Gly Gly Glu Ser 
130 135 140 

caa aat gat etc cga gaa gcc aga gtg aaa ate ata agt gac gat gtc 
Gin Asn Asp Leu Arg Glu Ala Arg Val Lys He He Ser Asp Asp Val 



480 



52 8 



tgc aag caa cca cag gtg tat ggc aat gat ata aaa cet gga atg ttc 
Cys Lys Gin Pro Gin Val Tyr Gly Asn Asp He Lys Pro Gly Met Phe 

165 170 175 

tqt gcc gga tat atg gaa gga att tat gat gee tgc agg ggt gat tct 576 
CVB Ala Gly Tyr Met Glu Gly He Tyr Asp Ala Cys Arg Gly Asp Ser 
180 IBS 190 



624 



672 



720 
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ggc ate taa ttcacgataa aagttaaaca aagaaagctg tatgcaggtc atatatgc 777 
Gly lie 

<210> 18 

<211> 242 

<212> PRT 

<213> Homo Sapien 

<220> 

<221> SITE 

<222> (11) . , - (242) 

<223> MTSP9 protease domain 

<400> 18 

Lys Arg Val Val Pro Leu Asn Val Asn Arg lie Ala Ser Gly Val lie 
1 5 10 X5 

Ala Pro Lys Ala Ala Trp Pro Trp Gin Ala Ser Leu Gin Tyr Asp Asn 
20 25 30 

He His Gin Cys Gly Ala Thr Leu He Ser Asn Thr Trp Leu Val Thr 
35 40 45 

Ala Ala His Cys Phe Gin Lys Tyr Lys Asn Pro His Gin Trp Thr Val 
50 55 60 

Ser Phe Gly Thr Lys He Asn Pro Pro Leu Met Lys Arg Asn Val Arg 
65 70 75 80 

Arg Phe He He His Glu Lys Tyr Arg Ser Ala Ala Arg Glu Tyr Asp 

85 90 95 

He Ala Val Val Gin Val Ser Ser Arg Val Thr Phe Ser Asp Asp He 
100 105 110 

Arg Arg He Cys Leu Pro Glu Ala Ser Ala Ser Phe Gin Pro Asn Leu 
115 120 125 

Thr Val His He Thr Gly Phe Gly Ala Leu Tyr Tyr Gly Gly Glu Ser 
130 135 140 

Gin Asn Asp Leu Arg Glu Ala Arg Val Lys He He Ser Asp Asp Val 
145 150 155 160 

CVS Lys Gin Pro Gin Val Tyr Gly Asn Asp He Lys Pro Gly Met Phe 
^ 165 170 175 

Cys Ala Gly Tyr Met Glu Gly He Tyr Asp Ala Cys Arg Gly Asp Ser 
180 185 190 

Gly Gly Pro Leu Val Thr Arg Asp Leu Lys Asp Thr Trp Tyr Leu He 
195 200 205 

Gly He Val Ser Trp Gly Asp Asn Cys Gly Gin Lys Asp Lys Pro Gly 
210 215 220 

Val Tyr Thr Gin Val Thr Tyr Tyr Arg Asn Trp He Ala Ser Lys Thr 
225 230 235 240 

Gly He 
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<210> 19 MTSP12 
<211> 3316 
<212> DNA 
<213> Homo Sapien 

<220> 

<221> CDS 

<222> (1)...{3282) 

<223> Nucleotide sequence encoding MTSP12, including 
MTSP12-PD1, m'SP12-PD2, Eind MTSP12-PD3 protease 
domains 

<400> 19 

atg gag ccc act gtg get aac gta cac etc gtg ccc agg aca acc aag 
Met Glu Pro Thr Val Ala Asn Val His Leu Val Pro Arg Thr Thr Lys 
15 10 15 



48 



gaa gtc ccc get ctg gat gcc gcg tgc tgt cga geg gcc acc att ggc 96 

Glu Val Pro Ala Leu Asp XLa Ala Cys Cys Arg Ala Ala Thr lie Gly 

20 25 30 

gtg gtg gcc acc age ett gte gtc etc acc ctg gga gtc ctt ttg gcc 144 

Val Val Ala Tin: Ser Leu Val Val Leu Thr Leu Gly Val Leu Leu Ala 
35 40 45 

tte etc tct aca cag ggc ttc eac gtg gac cac acg gcc gag ctg egg 192 

Phe Leu Ser Thr Gin Gly Phe His Val Asp His Thr Ala Glu Leu Arg 
SO 55 60 



gga ate egg tgg acc age agt ttg egg egg gag acc teg gac tat cac 
Gly He Arg Trp Thr Ser Ser Leu Arg Arg Glu Thr Ser Asp Tyx Has 
65 70 75 80 

cgc acg ctg acg ccc ace ctg gag gca ctg ttt gta agt agt ttt cag 
Arg Thr Leu Thr Pro Thr Leu Glu Ala Leu Phe Val Ser Ser Phe Gin 

85 90 95 



240 



288 



aag aca gag tta gag gca age tgc gtg ggt tgc teg gta ctg aat tat 336 

Lys Thr Glu Leu Glu Ala Ser Cys Val Gly Cys Ser Val Leu Asn Tyr 
100 105 110 

agg gat ggg aac tee agt gtc etc gta cat ttc cag ctg cac ttt ctg 384 

Ara Asp Gly Asn Ser Ser Val Leu Val His Phe Gin Leu His Phe Leu 

111 120 125 

ctg cga ccc etc cag acg ctg age ctg ggc ctg gag gag gag eta ttg 432 

Leu Arg Pro Leu Gin Thr- Leu Ser Leu Gly Leu Glu Glu Glu Leu Leu 
130 135 140 

cag cga ggg ate egg gca agg ctg egg gag cac ggc ate tec ctg get 

Gin Arg Gly He Arg Ala Arg Leu Arg Glu His Gly He Ser Leu Ala 
145 150 155 160 



480 



gcc tat ggc aca att gtg teg get gag etc aca ggg aga cat aag ggg 528 
Ala Tyr Gly Thr He Val Ser Ala Glu Leu Thr Gly Arg His Lys Gly 

165 170 175 

ccc ttg gca gaa aga gac ttc aaa tea ggc cgc tgt cca ggg aac tec 576 
Pro Leu Ala Glu Arg Asp Phe Lys Ser Gly Arg Cys Pro Gly Asn Ser 
180 185 190 
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ttt tec tgc ggg aac age cag tgt gtg acc aag gtg aac ccg gag tgt 624 
Phe Ser Cys Gly Asn Ser Gin Cys Val Thr Lys Val Asn Pro Glu Cys 
195 200 205 

gac gac cag g^g gac tgc tec gat ggg tec gac gag gcg cac tgc gag 672 
Asp Asp Gin Glu Asp Cys Ser Asp Gly Ser Asp Glu Ala His Cys Glu 
210 215 220 



tgt ggc ttg cag cct gcc tgg agg atg gcc ggc agg ate gtg ggc ggc 
Cys Gly Leu Gin Pro Ala Trp Arg Met Ala Gly Arg lie Val Gly Gly 
225 230 235 240 



gtg tct get get cac tgc ttc aat gag ttc caa gac ccg acg aag tgg 
Val Ser Ala Ala His Cys Phe Asn Glu Phe Gin Asp Pro Thr Lys Trp 
275 280 285 



aag gag gac ttc ctg gtc aag cca ggg gtg ctg cag aaa gcc act gtg 
Lys Glu Asp Phe Leu Val Lys Pro Gly Val Leu Gin Lys Ala Thr Val 
370 375 380 

gag ctg ctg gac cag gca ctg tgt gcc age ttg tac ggc cat tea etc 
Glu Leu Leu Asp Gin Ala Leu Cys Ala Ser Leu Tyr Gly His Ser Leu 
385 390 395 400 

act gac agg atg gtg tgc get ggc tac ctg gac ggg aag gtg gac tec 
Thr Asp Arg Met Val Cys Ala Gly Tyr Leu Asp Gly Lys Val Asp Ser 

405 410 415 



720 



atg gaa gca tec ccg ggg gag ttt ccg tgg caa gcc age ctt ega gag 768 
Met Glu Ala Ser Pro Gly Glu Phe Pro Trp Gin Ala Ser Leu Arg Glu 

245 250 255 

aac aag gag cac tte tgt ggg gcc gcc ate ate aac gcc agg tgg ctg 816 
Asn Lys Glu His Phe Cys Gly Ala Ala He He Aan Ala Arg Trp Leu 
260 265 270 



864 



960 



gtg gcc tac gtg ggt gcg acc tac etc age ggc teg gag gee age acc 912 
Val Ala Tyr Val Gly Ala Thr Tyr Leu Ser Gly Ser Glu Ala Ser Thr 
290 295 300 

gtg egg gcc cag gtg gtc cag ate gtc aag cac cec ctg tac aac gcg 
Val Arg Ala Gin Val Val Gin He Val Lys His Pro Leu Tyr Asn Ala 
305 310 315 320 

gac acg gcc gac ttt gac gtg get gtg ctg gag ctg acc age cct ctg X008 
Asp Thr Ala Asp Phe Asp Val Ala Val Leu Glu Leu Thr Ser Pro Leu 

325 330 335 

cct ttc ggc egg cac ate cag cec gtg tgc etc ccg get gcc aca cac 1056 
Pro Phe Gly Arg His He Gin Pro Val Cys Leu Pro Ala Ala Thr His 
340 345 350 

ate ttc cca cec age aag aag tgc ctg ate tea ggc tgg ggc tac etc 1104 
He Phe Pro Pro Ser Lys Lys Cys Leu He Ser Gly Trp Gly Tyr Leu 
355 360 365 



1152 



1200 



1248 



tgc cag ggt gac tea gga gga cec ctg gtc tgc gag gag cec tct ggc 12 96 

Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys Glu Glu Pro Ser Gly 
420 425 430 

egg ttc tct ctg get ggc ate gtg age tgg gga ate ggg tgt gcg gaa 1344 



1 
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ttc gac ctg get gtc ctg gag ctg gcc age ccc ctg gcc ttc aac aaa 
Phe Asp Leu Ala Val Leu Glu Leu Ala Ser Pro Leu Ala Phe Asn Lys 
625 630 635 640 

tac ate cag cct gtc tgc ctg ccc ctg gcc ate egg aag ttc cct gtg 
Tyr He Gin Pro Val Cys Leu Pro Leu Ala He Arg Lys Phe Pro Val 

645 650 655 



-35 



Arg Phe Ser Leu Ala Gly He Val Ser Trp Gly He Gly Cys Ala Glu 
435 440 445 

gcc egg cgt cca ggg gtc tat gcc cga gtc ace agg eta cgt gac tgg 1392 
Ala Arg Arg Pro Gly Val Tyr Ala Arg Val Thr Arg Leu Arg Asp Trp 
450 455 460 

ate ctg gag gcc ace acc aaa gcc age atg cct ctg gcc ccc acc atg 1440 
He Leu Glu Ala Thr Thr Lys Ala Ser Met Pro Leu Ala Pro Thr Met 
465 470 475 480 

get cct gee cct gcc gcc ccc age aca gee tgg eee acc agt cct gag 1488 

Ala Pro Ala Pro Ala Ala Pro' Ser Thr Ala Trp Pro Thr Ser Pro Glu 

485 490 495 

u 

age cct gtt gtc age acc ccc acc aaa teg atg cag gee etc agt ace 1536 

Ser Pro Val Val Ser Thr Pro Thr Lys Ser Met Gin Ala Leu Ser Thr 
500 505 SIO 

gtg cct ctt gac tgg gtc acc gtt cct aag eta caa gaa tgt ggg gee 1584 
Val Pro Leu Asp Trp Val Thr Val Pro Lys Leu Gin Glu Cys Gly Ala 
515 520 525 

agg cct gca atg gag aag ccc acc egg gtc gtg ggc ggg ttc gga get 
Arg Pro Ala Met Glu Lys Pro Thr Arg Val Val Gly Gly Phe Gly Ala 
530 535 540 

gcc tec ggg gag gtg ccc tgg cag gtc age ctg aag gaa ggg tec egg 
Ala Ser Gly Glu Val Pro Trp Gin Val Ser Leu Lys Glu Gly Ser Arg 
545 550 555 560 

eac ttc tgc gga gca act gtg gtg ggg gac cgc tgg ctg ctg tet gcc 1728 
His Phe Cys Gly Ala Thr Val Val Gly Asp Arg Trp Leu Leu Ser Ala 

565 570 575 

gee cae tgc ttc aac cac acg aag gtg gag cag gtt egg gcc cac ctg 1776 
Ala His Cys Phe Asn His Thr Lye Val Glu Gin Val Arg Ala His Leu 
580 585 590 

ggc act gcg tec etc ctg ggc ctg ggc ggg age ceg gtg aag ate ggg 1824 
Gly Thr Ala Ser Leu Leu Gly Leu Gly Gly Ser Pro Val Lys He Gly 
595 600 605 

ctg egg egg gta gtg ctg cae ccc etc tac aac cct ggc ate ctg gac 1872 
Leu Arg Arg Val Val Leu His Pro Leu Tyr Asn Pro Gly He Leu Asp 
610 615 620 



1632 



1680 



1920 



1968 



ggc egg aag tgc atg ate tec gga tgg gga aat acg cag gaa gga aat 2016 
Gly Arg Lys Cys Met He Ser Gly Trp Gly Asn Thr Gin Glu Gly Asn 
660 665 670 

gcc acc aag ccc gag etc ctg cag aag gcg tec gtg ggc ate at a gac 2064 
Ala Thr Lye Pro Glu Leu Leu Gin Lys Ala Ser Val Gly He He Asp 
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675 680 6B5 



ctg gcc acc acc age ccc agg acg aca get ggc etc aea gtc ccg ggg 
Leu Ala Thr Thr Ser Pro Arg Thr Thr Ala Gly Leu Thr Val Pro Gly 
785 790 795 8O0 

gcc aca ccc age aga ccc acc cct ggg get gcc age agg gtg acg ggc 
Ala Thr Pro Ser Arg Pro Thr Pro Gly Ala Ala Ser Arg Val Thr Gly 

805 810 815 



cag aaa acc tgt agt gtg etc tac aac ttc tec etc aca gac egc atg 2112 
Gin Lys Thr Cys Ser Val Leu Tyr Asn Phe Ser Leu Thr Asp Arg Met 
690 695 700 

ate tgc gca ggc ttc ctg gaa ggc aaa gtc gac tec tgc cag ggt gac 2160 
He Cys Ala Gly Phe Leu Glu Gly Lys Val Asp Ser Cys Gin Gly Asp 
705 710 715 720 

tct ggg ggc ccc ctg gcc tgc gag gag gcc cct ggc gtg ttt tat ctg 2208 
Ser Glv Glv Pro Leu Ala Cys Glu Glu Ala Pro Gly Val Phe Tyr Leu 

725 730 735 

gca ggg ate gtg age tgg ggt att ggc tgc get cag gtt aag aag ccg 2256 
Ala Gly He Val Ser Trp Gly He Gly Cys Ala Gin Val Lys Lye Pro 
740 745 750 

ggc gtg tac acg cgc ate ace agg eta aag ggc tgg ate ctg gag ate 2304 
Gly Val Tyr Thr Arg He Thr Arg Leu Lys Gly Trp He Leu Glu He 
755 760 765 

atg tec tec cag ccc ctt ccc atg tct ccc ccc teg ace aca agg atg 2352 
Met Ser Ser Gin Pro Leu Pro Met Ser Pro Pro Ser Thr Thr Arg Met 
770 775 780 



2400 



2446 



2496 



2544 



caa cct gcc aac tea acc tta tct gee gtg age acc act get agg gga 
Gin Pro Ala Asn Ser Thr Leu Ser Ala Val Ser Thr Thr Ala Arg Gly 
820 825 830 

cag acg eca ttt cca gac gcc ccg gag gcc acc aca cac ace cag eta 
Gin Thr Pro Phe Pro Asp Ala Pro Glu Ala Thr Thr His Thr Gin Leu' 
835 840 845 

cca gac tgt ggc ctg geg ccg gcc gcg etc acc agg att gtg ggc ggc 2592 
Pro Asp Cys Gly Leu Ala Pro Ala Ala Leu Thr Arg He Val Gly Gly 
850 855 860 

age gca gcg ggc cgt ggg gag tgg ccg tgg cag gtg gge ctg tgg ctg 2 640 
Ser Ala Ala Gly Arg Gly Glu Trp Pro Trp Gin Val Gly Leu Trp Leu 
865 870 875 880 

egg cgc egg gaa eac cgt tgc ggg gcc gtg ctg gtg gca gag agg tgg 
Atq Arg Arg Glu His Arg Cys Gly Ala Val Leu Val Ala Glu Arg Trp 

885 890 895 

ctg ctg teg gcg gcg cac tgc ttc gac gtc tac ggg gac ccc aag cag 
Leu Leu Ser Ala Ala Hie Cys Phe Asp Val Tyr Gly Asp Pro Lys Gin 
900 905 910 

tgg gcg gcc ttc eta ggc acg ccg ttc ctg age ggc gcg gag ggg cag 2784 
TrD Ala Ala Phe Leu Gly Thr Pro Phe Leu Ser Gly Ala Glu Gly Gin 
915 920 925 



2686 



2736 
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ctg gag cgc gtg gcg cgc ate tac aag cac ccg ttc tac aat etc tac 2832 
Leu Glu Arg Val Ala Arg lie Tyr Lys His Pro Phe Tyx Asn Leu Tyr 
930 935 940 



acgt 

<210> 20 

<211> 1093 

<212> PRT 

<213> Homo Sapien 

<400> 20 

Met Glu Pro Thr Val Ala Asn Val His Leu Val Pro Arg Thr Thr Lys 

15 10 15 

Glu Val Pro Ala Leu Asp Ala Ala Cys Cys Arg Ala Ala Thr lie Gly 

20 25 30 

Val Val Ala Thr Ser Leu Val Val Leu Thr Leu Gly Val Leu Leu Ala 

35 40 45 

Phe Leu Ser Thr Gin Gly Phe His Val Asp His Thr Ala Glu Leu Arg 
50 55 60 



2880 



2928 



acg etc gac tac gac gtg gcg ctt ctg gag ctg gcg ggg ccg gtg cgt 
Thr Leu Asp Tyr Asp Val Ala Leu Leu Glu Leu Ala Gly Pro Val Arg 
945 950 955 960 

cgc age cgc ctg gtg cgt ccc ate tgc ctg ccc gag ccc gcg ccg cga 
Arg Ser Arg Leu Val Arg Pro lie Cys Leu Pro Glu Pro Ala Pro Arg 

965 970 975 

ccc ccg gac ggc acg cgc tgc gtc ate ace ggc tgg ggc teg gtg cgc 2976 
Pro Pro Asp Gly Thr Arg Cys Val He Thr Gly Trp Gly Ser Val Arg 
980 985 990 

gaa gga ggc tec atg gcg egg cag ctg cag aag gcg gcc gtg cgc etc 3024 
Glu Gly Gly Ser Met Ala Arg Gin Leu Gin Lys Ala Ala Val Arg Leu 
995 1000 1005 

etc age gag cag acc tgc cgc cgc ttc tac cea gtg eag ate age age 3 072 

Leu Ser Glu Gin Thr Cys Arg Arg Phe Tyr Pro Val Gin He Ser Ser 
1010 1015 1020 

cgc atg ctg tgt gee ggc ttc ccg cag ggt ggc gtg gac age tgc teg 3120 
Arg Met Leu Cys Ala Gly Phe Pro Gin Gly Gly Val Asp Ser Cys Ser 
1025 1030 1035 1040 

ggt gac get ggg gga ccc ctg gee tgc agg gag ccc tct gga egg tgg 3168 
Glv ASP Ala Gly Gly Pro Leu Ala Cys Arg Glu Pro Ser Gly Arg Trp 

1045 1050 1055 

gtg eta act ggg gtc act age tgg ggc tat ggc tgt ggc egg ccc cac 3216 
Val Leu Thr Gly Val Thr Ser Trp Gly Tyr Gly Cys Gly Arg Pro His 
1060 1065 1070 

ttc cca ggt gtc tat acc egg gtg gca get gtg aga ggc tgg at a gga 3264 
Phe Pro Gly Val Tyr Thr Arg Val Ala Ala Val Arg Gly Trp He Gly 
1075 1080 1085 

cag cac ate cag gag tga ccaecacgtg actgcccagg ccgagactct 3312 
Gin His He Gin Glu * 
1090 



3316 
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Gly 


He 


Arg 


Trp 


Thr 


Ser 


Ser 


Leu Arg Arg Glu Thr 


Ser 


Asp 


Tyr 


His 


65 










70 




75 








80 


Arg 


Thr 


Leu 


Thr 


Pro 


Thr 


Leu 


Glu Ala Leu Phe Val 


Ser 


Ser 


Phe 


Gin 










85 






90 






95 




Lye 


Thr 


Glu 


Leu 


Glu 


Ala 


Ser 


Cys Val Gly Cys Ser 


Val 


Leu 


Asn 


Tyr 








100 








105 




110 






Arg 


Asp 


Gly 


Asn 


Ser 


Ser 


Val 


Leu Val His Phe Gin 


Leu 


His 


Phe 


Leu 




115 










120 


125 








Leu 


Arg 


Pro 


Leu 


Gin 


Thr 


Leu 


Ser Leu Gly Leu Glu 


Glu 


Glu 


Leu 


Leu 




130 










135 


140 










Gin 


Arg 


Gly 


He 


Arg Ala Arg 


Leu Arg Glu His Gly 


He 


Ser 


Leu 


Ala 


145 










150 




155 








160 


Ala 


Tyr 


Gly 


Thr 


He 


Val 


Ser Ala Glu Leu Thr Gly 


Arg 


His 


Lys 


Gly 






165 






170 






175 




Pro 


Leu 


Ala 


Glu 


Arg 


Asp 


Phe 


Lys Ser Gly Arg Cys 


Pro 


Gly 


Asn 


Ser 








180 








185 




190 






Phe 


Ser 


Cys 


Gly 


Asn 


Ser 


Gin 


Cys Val Thr Lys Val 


Asn 


Pro 


Glu 


Cys 






195 








200 


205 








Asp 


Asp 


Gin 


Glu 


Asp 


Cys 


Ser Asp Gly Ser Asp Glu 


Ala 


His 


Cys 


Glu 




210 










215 


220 










Cye 


Gly 


Leu 


Gin 


Pro 


Ala 


Trp Arg Met Ala Gly Arg 


He 


Val 


Gly Gly 


225 








230 




235 








240 


Met 


Glu 


Ala 


Ser 


Pro 


Gly 


Glu 


Phe Pro Trp Gin Ala 


Ser 


Leu Arg 


Glu 










245 






250 






255 




Asn 


Lys 


Glu 


His 


Phe 


Cys 


Gly Ala Ala He He Asn 


Ala 


Arg 


Trp 


Leu 






260 






265 




270 






Val 


Ser 


Ala 


Ala 


His 


Cys 


Phe 


Asn Glu Phe Gin Asp 


Pro 


Thr 


Lys 


Trp 






275 








260 


285 








Val 


Ala 


Tyr 


Val 


Gly Ala 


Thr 


Tyr Leu Ser Gly Ser 


Glu 


Ala 


Ser 


Thr 




290 








295 


300 










Val 


Arg 


Ala 


Gin 


Val 


Val 


Gin 


He Val Lys His Pro 


Leu 


Tyr 


Asn 


Ala 


305 








310 




315 








320 


A5p 


Thr 


Ala 


Asp 


Phe 


Asp 


Val 


Ala Val Leu Glu Leu 


Thr 


Ser 


Pro 


Leu 






325 






330 






335 




Pro 


Phe 


Gly 


Arg 


His 


He 


Gin 


Pro Val Cys Leu Pro 


Ala 


Ala 


Thr 


His 






340 








345 




350 






He 


Phe 


Pro 


Pro 


Ser 


Lys 


Lys 


Cys Leu He Ser Gly 


Trp 


Gly Tyr 


Leu 






355 










360 


365 








Lys 


Glu 


Asp 


Phe 


Leu 


Val 


Lys 


Pro Gly Val Leu Gin 


Lys 


Ala 


Thr 


Val 


370 








375 


380 










Glu 


Leu 


Leu 


Asp 


Gin 


Ala 


Leu 


Cys Ala Ser Leu Tyr 


Gly His 


Ser 


Leu 


3 85 








390 




3 95 








400 


Thr 


Asp 


Arg 


Met 


Val 


Cys 


Ala 


Gly Tyr Leu Asp Gly 


Lys 


Val 


Asp 


Ser 






405 






410 






415 




Cys 


Gin 


Gly 


Asp 


Ser 


Gly Gly 


Pro Leu Val Cys Glu 


Glu 


Pro 


Ser 


Gly 




420 








425 




430 






Arg 


Phe 


Ser 


Leu 


Ala Gly 


He 


Val Ser Trp Gly He 


Gly 


Cys 


Ala 


Glu 




435 










440 


445 








Ala 


Arg 


Arg 


Pro 


Gly Val 


Tyr Ala Arg Val Thr Arg 


Leu 


Arg 


Asp 


Trp 




450 








455 


460 










He 


Leu 


Glu 


Ala 


Thr 


Thr 


Lys 


Ala Ser Met Pro Leu 


Ala 


Pro 


Thr 


Met 


465 










470 




475 








480 


Ala 


Pro 


Ala 


Pro 


Ala 


Ala 


Pro 


Ser Thr Ala Trp Pro 


Thr 


Ser 


Pro 


Glu 










485 






490 






495 




Ser 


Pro 


Val 


Val 


Ser 


Thr 


Pro 


Thr Lys Ser Met Gin 


Ala 


Leu 


Ser 


Thr 








500 








505 




510 






Val 


Pro 


Leu 


Asp 


Trp 


Val 


Thr 


Val Pro Lys Leu Gin 


Glu 


Cys 


Gly Ala 






515 








520 


525 








Arg 


Pro 


Ala 


Met 


Glu 


Lys 


Pro 


Thr Arg Val Val Gly 


Gly 


Phe 


Gly Ala 


530 










535 


540 










Ala 


Ser 


Gly 


Glu 


Val 


Pro 


Trp 


Gin Val Ser Leu Lys 


Glu Gly 


Ser 


Arg 



wo 02/095007 



PCT/US02/16819 



-39- 



545 








DDL) 






555 








560 


His Phe 


Cys 


Gly Ala 


TilX 


vaJ. 


Val 


Gly Asp Arg Trp 


Leu 


Leu 


Ser 


Ala 






565 








570 






575 




Ala His 


Cys 


Phe 


Asn 


His 


Tnr 


Lye 


Val Glu Gin Val 


Arg Ala 


His 


Leu 




580 










585 




can 






Gly Thr Ala 


Ser 


Leu 


Leu 


Gly 


Leu Gly Gly Ser Pro Val 


Lys 


He 


Gly 




595 










600 




605 








Leu Arg Arg Val Val 


Leu 


His 


Pro 


Leu Tyr Asn Pro 


Gly 


J. j.e 


Leu Asp 


610 










615 




620 










Phe Asp 


Leu 


Ala 


Val 


Leu 


Glu 


Leu 


Ala Ser Pro Leu 


Ala 


Phe 


Asn 


Lys 


625 








630 






635 








640 


Tyx lie 


Gin 


Pro 


Val 


Cys 


Leu 


Pro 


Leu Ala He Arg 


Lys 


Phe 


Pro 


Val 






645 






650 






655 




Gly Arg Lys 


Cys 


Met 


He 


Ser 


Gly Trp Gly Asn Thr 


Gin 


Glu 


Gly Asn 






660 










665 




670 






Ala Thr 


Lys 
675 


Pro 


Glu 


Leu 


Leu 


Gin 
680 


Lye Ala Ser Val 


Gly 
685 


He 


He 


Asp 


Gin Lys 


Thr 


Cys 


Ser 


Val 


Leu 


Tyr 


Asn Phe Ser Leu 


Thr 


Asp 


Arg 


Met 


690 








695 




700 










lie Cys 


Ala Gly Phe 


Leu 


Glu 


Gly Lys Val Asp Ser 


Cys 


Gin 


Gly Asp 


705 








710 






715 








720 


Sex Gly 


Gly 


Pro 


Leu 


Ala 


Cys 


Glu Glu Ala Pro Gly Val 


Phe 


Tyr 


Leu 




725 






730 






735 




Ala Gly 


lie 


Val 


Ser 


Trp Gly 


He 


Gly Cys Ala Gin 


Val 


Lys 


Lys 


Pro 




740 










745 




750 




He 


Gly Val 


Tyr 


Thr 


Arg 


He 


Thr 


Arg 


Leu Lys Gly Trp 


He 


Leu 


Glu 


755 










760 




765 








Met Ser 


Ser 


Gin 


Pro 


Leu 


Pro 


Met 


Ser Pro Pro Ser 


Thr 


Thr 


Arg 


Met 


770 










775 




780 








Gly 


Leu Ala 


Thr 


Thr 


Ser 


Pro 


Arg 


Thr 


Thr Ala Gly Leu Thr Val 


Pro 


785 








790 




795 








800 


Ala Thr 


Pro 


Ser 


Arg 


Pro 


Thr 


Pro 


Gly Ala Ala Ser 


Arg 


Val 


Thr 


Gly 








805 








810 






815 


Gly 


Gin Pro 


Ala 


Asn 
820 


Ser 


Thr 


Leu 


Ser 


Ala Val Ser Thr 
825 


Thr 


Ala 
830 


Arg 


Gin Thr 


Pro 


Phe 


Pro 


Asp 


Ala 


Pro 


Glu Ala Thr Thr 


His 


Thr 


Gin 


Leu 




835 








840 




845 








Pro Asp 


Cys 


Gly Leu 


Ala 


Pro 


Ala 


Ala Leu Thr Arg 


He 


Val 


Gly 


Gly 


850 








855 




860 










Ser Ala 


Ala 


Gly Arg 


Gly 


Glu 


Trp 


Pro Trp Gin Val 


Gly 


Leu 


Trp 


Leu 


865 








870 






875 








880 


Arg Arg Arg 


Glu 


His 


Arg 


Cys 


Gly Ala Val Leu Val 


Ala 


Glu 


Arg 


Trp 








885 








890 






895 




Leu Leu 


Ser 


Ala 


Ala 


His 


Cys 


Phe Asp Val Tyr Gly Asp 


Pro 


Lys 


Gin 






900 










905 




910 






Trp Ala 


Ala 


Phe 


Leu 


Gly 


Thr 


Pro 


Phe Leu Ser Gly Ala Glu 


Gly Gin 


915 










920 




925 








Leu Glu Arg 


Val 


Ala 


Arg 


lie 


Tyr 


Lys His Pro Phe 


Tyr 


Asn 


Leu 


Tyr 


930 










935 




940 










Thr Leu Asp 


Tyr Asp 


Val 


Ala 


Leu Leu Glu Leu Ala Gly 


Pro 


Val 


Arg 


945 








950 






955 








960 


Arg Ser Arg 


Leu 


Val 


Arg 


Pro 


He 


Cys Leu Pro Glu 


Pro 


Ala 


Pro 


Arg 








965 








970 






975 




Pro Pro Asp Gly Thr 


Arg 


Cys 


Val 


He Thr Gly Trp 


Gly 


Ser 


Val 


Arg 






980 










985 




990 






Glu Gly Gly 


Ser 


Met 


Ala 


Arg 


Gin 


Leu Gin Lys Ala 


Ala 


Val 


Arg 


Leu 




995 










1000 


1005 






Leu Ser 


Glu 


Gin 


Thr 


Cys 


Arg 


Arg 


Phe Tyr Pro Val 


Gin 


He 


Ser 


Ser 


1010 








1015 


1020 








Arg Met 


Leu 


Cys 


Ala 


Gly 


Phe 


Pro 


Gin Gly Gly Val Asp 


Ser 


Cys 


Ser 


1025 






1030 




1035 








1041 
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Gly Asp Ala Gly Gly Pro Leu Ala Cys Arg Glu Pro Ser Gly Arg Trp 

1045 1050 1055 

Val Leu Thr Gly Val Ttir Ser Trp Gly Tyr Gly Cys Gly Arg Pro His 

1060 1065 1070 

Phe Pro Gly Val Tyr Thr Arg Val Ala Ala Val Arg Gly Trp lie Gly 

1075 1060 1085 

Gin His lie Gin Glu 
1090 

<210> 21 

<211> 702 

<2X2> DNA 

<213> Homo Sapien 

<220> 

<221> CDS 

<222> (1) . . . (699) 

<223> Nucleic Acid encoding protease domain of 
endotheliase 1 

<:400> 21 

agg ate gtt ggt ggg aca gaa gta gaa gag ggt gaa tgg ccc tgg cag 4 8 

Arq He Val Gly Gly Thr Glu Val Glu Glu Gly Glu Trp Pro Trp Gin 
15 10 '15 

get age ctg cag tgg gat ggg agt cat cgc tgt gga gca acc tta att 96 
Ala Ser Leu Gin Trp Asp Gly Ser His Arg Cys Gly Ala Thr Leu He 
20 25 30 

aat gcc aca tgg ctt gtg agt get get cac tgt ttt aca aca tat aag 144 
Asn Ala Thr Trp Leu Val Ser Ala Ala His Cys Phe Thr Thr Tyr Lys 
35 40 45 

aac cct gcc aga tgg act get tec ttt gga gta aca ata aaa cct teg 192 
Asn Pro Ala Arg Trp Thr Ala Ser Phe Gly Val Thr He Lys Pro Ser 



50 55 60 

aaa atg aaa egg ggt etc egg aga ata att gtc eat gaa aaa tac aaa 
LvB Met Lys Arg Gly Leu Arg Arg He He Val His Glu Lys Tyr Lys 
65 70 75 80 

eac cca tea cat gac tat gat att tct ctt gca gag ctt tct age cct 
His Pro Ser His Asp Tyr Asp He Ser Leu Ala Glu Leu Ser Ser Pro 

85 90 95 

gtt ccc tac aca aat gca gta eat aga gtt tgt etc cet gat gca tec 
Val Pro Tyr Thr Asn Ala Val His Arg Val Cys Leu Pro Asp Ala Ser 
100 105 110 



240 



288 



336 



tat gag ttt caa cca ggt gat gtg atg ttt gtg aca gga ttt gga gca 384 
Tyr Glu Phe Gin Pro Gly Asp Val Met Phe Val Thr Gly Phe Gly Ala 
115 120 125 

ctg aaa aat gat ggt tac agt caa aat cat ett ega caa gca cag gtg 432 
Leu Lys Asn Asp Gly Tyr Ser Gin Asn His Leu Arg Gin Ala Gin Val 
130 135 140 



act etc ata gac get aca act tge aat gaa cet caa get tac aat gac 
Thr Leu He Asp Ala Thr Thr Cys Asn Glu Pro Gin Ala Tyr Asn Asp 
145 150 155 160 



480 
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gcc ata act cct aga atg tta tgt get ggc tec tta gaa gga aaa aca 528 
Ala lie Thr Pro Arg Met Leu Cys Ala Gly Ser Leu Glu Gly Lys Thr 

165 170 175 

gat gca tgc cag ggt gac tct gga gga cca ctg gtt agt tea gat get 576 
Asp Ala C?ys Gin Gly Asp Ser Gly Gly Pro Leu Val Ser Ser Asp Ala 
ISO 185 190 

aga gat ate tgg tac ett get gga ata gtg age tgg gga gat gaa tgt 624 
Arg Asp lie Trp Tyr Leu Ala Gly He Val Ser Trp Gly Asp Glu Cys 
195 200 205 

gcg aaa ccc aac aag cct ggt gtt tat act aga gtt acg gcc ttg egg .672 
Ala Lys Pro Asn Lys Pro Gly Val Tyr Thr Arg Val Thr Ala Leu Arg 
210 215 220 

gac tgg att act tea aaa act ggt ate taa 702 
Aap Trp He Thr Ser Lys Thr Gly He 
225 230 

<210> 22 

<211> 233 

<212> PRT 

<213> Homo Sapien 



<220> 

<221> SITE 

<222> (1) . . . (233) 

<223> Protease domain of endotheliase 1 



Arg 
1 


He 


Val 


Gly 


Gly 
5 


Thr 


Glu 


Val 


Glu 


Glu Gly Glu Trp 
10 


Pro 


Trp 
15 


Gin 


Ala 


Ser 


Leu 


Gin 


Trp 


Asp 


Gly 


Ser 


His 


Tlrg Cys 


Gly Ala 


Thr 


Leu 


He 








20 








25 






30 






Asn 


Ala 


Thr 


Trp 


Leu 


Val 


Ser 


Ala 


Ala 


His Cys 


Phe Thr 


Thr 


Tyr 


Lys 






35 








40 






45 








Asn 


Pro 


Ala 


Arg 


Trp 


Thr 


Ala 


Ser 


Phe 


Gly Val 


Thr He 


Lys 


Pro 


Ser 




SO 








55 








60 








Lys 
65 


Met 


Lys 


Arg 


Gly 


Leu 
70 


Arg 


Arg 


He 


He Val 
75 


His Glu 


Lys 


Tyr 


Lys 
80 


His 


Pro 


Ser 


His 


Asp 
85 


Tyr 


Asp 


He 


Ser 


Leu Ala 
90 


Glu Leu 


Ser 


Ser 
95 


Pro 


Val 


Pro 


Tyr 


Thr 


Asn 


Ala 


Val 


His 


Arg 


Val Cys 


Leu Pro 


Asp 


Ala 


Ser 






100 










105 






110 






Tyr 


Glu 


Phe 


Gin 


Pro 


Gly 


Asp 


Val 


Met 


Phe Val 


Thr Gly Phe 


Gly 


Ala 




115 










120 






125 






Val 


Leu 


Lys 
130 


Asn 


Asp 


Gly 


Tyr 


Ser 
135 


Gin 


Asn 


His Leu 


Arg Gin 
140 


Ala 


Gin 


Thr 


Leu 


He 


Asp 


Ala 


Thr 


Thr 


Cys 


Asn 


Glu Pro 


Gin Ala 


Tyr 


Asn 


Asp 


145 








150 








155 








160 


Ala 


He 


Thr 


Pro 


Arg 


Met 


Leu 


Cys 


Ala 


Gly Ser 


Leu Glu Gly 


Lys 


Thr 








165 










170 






175 




Asp 


Ala 


Cys 


Gin 


Gly 


Asp 


Ser 


Gly 


Gly 


Pro Leu 


Val Ser 


Ser Asp 


Ala 




180 










185 






190 


Glu 




Arg 


Asp 


He 


Trp 


Tyr 


Leu 


Ala 


Gly 


He 


Val Ser Trp Gly Asp 


Cys 


195 








200 














Ala 


Lys 


Pro 


Asn 


Lys 


Pro 


Gly 


Val 


Tyr 


Thr Arg Val Thr 


Ala 


Leu 


Arg 




210 










215 








220 








Asp 


Trp 


He 


Thr 


Ser 


Lys 


Thr 


Gly 


He 
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225 230 



<210> 23 

<211> 1689 

<212> DNA 

<213> Homo Sapien 



<220> 

<221> CDS 

<222> (1) . . . (1689) 

<223> Nucleic acid encoding Endotheliase 2-S protein 
<400> 23 

atg gag agg gac age cac ggg aat gca tct cca gca aga aca cct tea 
Met Glu Arg Asp Ser His Gly Asn Ala Ser Pro Ala Arg Thr Pro Ser 
15 10 15 

get gga gca tct cca gcc cag gca tct cca get ggg aca cct cca ggc 
Ala Gly Ala Ser Pro Ala Gin Ala Ser Pro Ala Gly Thr Pro Pro Gly 
20 25 30 



48 



96 



egg gca tct cca gcc cag gca tct cca gcc cag gca tct cca get ggg 144 
Arg Ala Ser Pro Ala Gin Ala Ser Pro Ala Gin Ala Ser Pro Ala Gly 
35 40 45 

aca cct ccg ggc egg gca tct cca gcc cag gca tct cca get ggt aca 192 
Thr Pro Pro Gly Arg TVla Ser Pro Ala Gin Ala Ser Pro Ala Gly Thr 
50 55 60 

cct cca ggc egg gca tct cca ggc egg gca tct cca gcc cag gca tct 240 
Pro Pro Gly Arg Ala Ser Pro Gly Arg Ala Ser Pro Ala Gin Ala Ser 
65 70 75 80 

cca gcc egg gca tct ccg get ctg gca tea ctt tec agg tec tea tec 
Pro Ala Arg Ala Ser Pro Ala Leu Ala Ser Leu Ser Arg Ser Ser Ser 

85 90 9S 

ggc agg tea tea tec gcc agg tea gcc teg gtg aca acc tec cca acc 
Glv Arg Ser Ser Ser Ala Arg Ser Ala Ser Val Thr Thr Ser Pro Thr 
100 105 110 

aga gtg tac ctt gtt aga gca aca cca gtg ggg get gta ccc ate cga 384 
Arg Val Tyr Leu Val Arg Ala Thr Pro Val Gly Ala Val Pro lie Arg 
115 120 125 

tea tct cct gcc agg tea gca cca gca acc agg gcc acc agg gag age 
Ser Ser Pro Ala Arg Ser Ala Pro Ala Thr Arg Ala Thr Arg Glu Ser 
130 135 140 

cca ggt acg age ctg ccc aag ttc acc tgg egg gag ggc cag aag cag 
Pro Gly Thr Ser Leu Pro Lys Phe Thr Trp Arg Glu Gly Gin Lys Gin 
145 150 155 160 



288 



336 



432 



480 



eta ccg etc ate ggg tgc gtg etc etc etc att gcc ctg gtg gtt teg 528 
Leu Pro Leu lie Gly Cys Val Leu Leu Leu lie Ala Leu Val Val Ser 

165 170 175 

etc ate ate etc ttc cag ttc tgg cag ggc cac aca ggg ate agg tac 576 
Leu He He Leu Phe Gin Phe Trp Gin Gly His Thr Gly He Arg Tyr 
180 185 190 
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aag gag cag agg gag age tgt ccc aag cac get gtt cgc tgt gac ggg 624 

Lys Glu Gin Arg Glu Ser Cys Pro Lys His Ala Val Arg Cys Asp Gly 

195 200 205 

gtg gtg gac tgc aag ctg aag agt gac gag ctg ggc tgc gtg agg ttt 672 

Val Val Asp Cys Lys Leu Lys Ser Asp Glu Leu Gly Cys Val Arg Phe 
210 215 220 

gac tgg gac aag tct ctg ctt aaa ate tac tet ggg tec tec cat cag 72 0 

Asp Trp Asp Lys Ser Leu Leu Lys lie Tyr Ser Gly Ser Ser His Gin 
225 230 235 240 

tgg ctt ccc ate tgt age age aac tgg aat gac tec tac tea gag aag 766 

Trp Leu Pro He Cys Ser Ser Asn Trp Asn Asp Ser Tyr Ser Glu Lys 

245 250 255 

ace tgc cag cag ctg ggt ttc gag agt get cac egg aea ace gag gtt 816 

Thr Cys Gin Gin Leu Gly Phe Glu Ser Ala His Arg Thr Thr Glu Val 
260 265 270 

gee cac agg gat ttt gee aac age ttc tea ate ttg aga tac aac tec 864 

Ala His Arg Asp Phe Ala Asn Ser Phe Ser He Leu Arg Tyr Asn Ser 

275 280 285 

ace ate cag gaa age etc cac agg tct gaa tgc cct tec cag egg tat 912 

Thr He Gin Glu Ser Leu His Arg Ser Glu Cys Pro Ser Gin Arg Tyr 
290 295 300 

ate tec etc cag tgt tee cac tgc gga ctg agg gee atg ace ggg egg 960 

He Ser Leu Gin Cys Ser His Cys Gly Leu Arg Ala Met Thr Gly Arg 
305 310 315 320 

ate gtg gga ggg gcg ctg gee teg gat age aag tgg cct tgg eaa gtg 1008 

He Val Gly Gly Ala Leu Ala Ser Asp Ser Lys Trp Pro Trp Gin Val 

325 330 335 

agt ctg cac ttc ggc acc ace cac ate tgt gga ggc acg etc att gac 1056 

Ser Leu His Phe Gly Thr Thr His He Cys Gly Gly Thr Leu He Asp 
340 345 350 

gcc cag tgg gtg etc act gcc gee cac tgc ttc ttc gtg acc egg gag 1104 

Ala Gin Trp Val Leu Thr Ala Ala His Cys Phe Phe Val Thr Arg Glu 

355 360 365 

aag gte ctg gag ggc tgg aag gtg tac gcg ggc acc age aac ctg cac 1152 

Lys Val Leu Glu Gly Trp Lys Val Tyr Ala Gly Thr Ser Asn Leu His 
370 375 380 

cag ttg cct gag gca gcc tec att gee gag ate ate ate aac age aat ' 1200 

Gin Leu Pro Glu Ala Ala Ser He Ala Glu He He He Asn Ser Asn 
385 390 395 400 

tac acc gat gag gag gac gac tat gac ate gcc etc atg egg ctg tec 1248 

Tyr Thr Asp Glu Glu Asp Asp Tyr Asp He Ala Leu Met Arg Leu Ser 

405 410 415 

aag ccc ctg acc ctg tec get cac ate cac cct get tgc etc ccc atg 1296 

Lys Pro Leu Thr Leu Ser Ala His He His Pro Ala Cys Leu Pro Met 
420 425 430 
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gtt ctt ccc tgg att tac age aag atg gag age gag gtg cga ttc ata 
Val Leu Pro Trp lie Tyr Ser Lys Met Glu Ser Glu Val Arg Phe He 
545 S50 555 560 

aaa tec taa 
Ijys Ser * 

<210> 24 

<211> 562 

<212> PRT 

<213> homo sapien 

<220> 

<221> protease domain of endotheliase 2 
<222> (321) . . (562) 



65 
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20 










25 
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50 
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70 








75 








80 
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Ser 


Pro 
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Ser Leu Ser 


Arg 


Ser 


Ser 


Ser 






85 










90 






95 




Arg 


Ser 


Ser 


Ser 


Ala 


Arg 


Ser 


Ala 


Ser Val Thr 


Thr 


Ser 


Pro 


Thr 




100 










105 






110 






Val 


Tyr 
115 


Leu 


Val 


Arg 


Ala 


Thr 
120 


Pro 


Val Gly Ala 


Val 
125 


Pro 


He 


Arg 



1440 



1488 



cat gga cag acc ttt age etc aat gag aec tgc tgg ate aca ggc ttt 1344 
His Gly Gin Thr Phe Ser Leu Asn Glu Thr Cys Trp He Thr Gly Phe 
435 440 445 

ggc aag acc agg gag aca gat gac aag aca tec ccc ttc etc egg gag 13 92 
Gly Lys Thr Arg Glu Thr Asp Asp Lys Thr Ser Pro Phe Leu Arg Glu 
450 455 460 

gtg cag gtc aat etc ate gac ttc aag aaa tgc aat gac tac ttg gtc 
Val Gin Val Asn Leu He Asp Phe Lys Lys Cys Asn Asp Tyr Leu Val 
465 470 475 480 

tat gac agt tac ctt acc cca agg atg atg tgt get ggg gac ctt cgt 
Tyr Asp Ser Tyr Leu Thr Pro Arg Met Met Cys Ala Gly Asp Leu Arg 

485 490 495 

ggg ggc aga gac tec tgc cag gga gac age ggg ggg cct ctt gtc tgt 1536 
Gly Gly Arg Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys 
500 505 510 

gag cag aac aac cgc tgg tac ctg gca ggt gtc acc age tgg ggc aca 1584 
Glu Gin Asn Asn Arg Trp Tyr Leu Ala Gly Val Thr Ser Trp Gly Thr 
515 520 525 

ggc tgt ggc cag aga aac aaa cct ggt gtg tac acc aaa gtg aca gaa 1632 
Glv Cvs Gly Gin Arg Asn Lys Pro Gly Val Tyr Thr Lys Val Thr Glu 
530 535 540 



1680 



1689 
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Ser Ser Pro Ala Arg Ser Ala Pro Ala Thr Arg Ala Thr Arg Glu Ser 

130 135 140 

Pro Gly Thr Ser Leu Pro Lys Phe Thr Trp Arg Glu Gly Gin Lys Gin 
145 150 155 160 

Leu Pro Leu lie Gly Cys Val Leu Leu Leu lie Ala Leu Val Val Ser 

165 170 175 

Leu lie He Leu Phe Gin Phe Trp Gin Gly His Thr Gly He Arg Tyr 

180 185 190 

Lys Glu Gin Arg Glu Ser Cys Pro Lys His Ala Val Arg Cys Asp Gly 

195 200 205 

Val Val Asp Cys Lys Leu Lys Ser Asp Glu Leu Gly Cys Val Arg Phe 

210 215 220 

Asp Trp Asp Lys Ser Leu Leu Lys He Tyr Ser Gly Ser Ser His Gin 
225 230 235 240 

Trp Leu Pro He Cys Ser Ser Asn Trp Asn Asp Ser Tyr Ser Glu Lys 

245 250 255 

Thr Cys Gin Gin Leu Gly Phe Glu Ser Ala His Arg Thr Thr Glu Val 

260 265 270 

Ala His Arg Asp Phe Ala Asn Ser Phe Ser He Leu Arg Tyr Asn Ser 

275 280 285 

Thr He Gin Glu Ser Leu His Arg Ser Glu Cys Pro Ser Gin Arg Tyr 

290 295 300 

He Ser Leu Gin Cys Ser His Cys Gly Leu Arg Ala Met Thr Gly Arg 
305 310 315 320 

He Val Gly Gly Ala Leu Ala Ser Asp Ser Lys Trp Pro Trp Gin Val 

325 330 335 

Ser Leu His Phe Gly Thr Thr His He Cys Gly Gly Thr Leu He Asp 

340 345 350 

Ala Gin Trp Val Leu Thr Ala Ala His Cys Phe Phe Val Thr Arg Glu 

355 360 365 

Lys Val Leu Glu Gly Trp Lys Val Tyr Ala Gly Thr Ser Asn Leu His 

370 375 380 

Gin Leu Pro Glu Ala Ala Ser He Ala Glu He He He Asn Ser Asn 
385 390 395 400 

Tyr Thr Asp Glu Glu Asp Asp Tyr Asp He Ala Leu Met Arg Leu Ser 

405 , 410 415 

Lys Pro Leu Thr Leu Ser Ala His' He His Pro Ala Cys Leu Pro Met 

420 425 430 

His Gly Gin Thr Phe Ser Leu Asn Glu Thr Cys Trp He Thr Gly Phe 

435 440 445 

Gly Lys Thr Arg Glu Thr Asp Asp Lys Thr Ser Pro Phe Leu Arg Glu 

450 455 460 

Val Gin Val Asn Leu He Asp Phe Lys Lys Cys Asn Asp Tyr Leu Val 
465 470 475 480 

Tyr Asp Ser Tyr Leu Thr Pro Arg Met Met Cys Ala Gly Asp Leu Arg 

485 490 495 

Gly Gly Arg Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys 

500 505 510 

Glu Gin Asn Asn Arg Trp Tyr Leu Ala Gly Val Thr Ser Trp Gly Thr 

515 520 525 

Gly Cys Gly Gin Arg Asn Lys Pro Gly Val Tyr Thr Lys Val Thr Glu 

530 535 540 

Val Leu Pro Trp He Tyr Ser Lys Met Glu Ser Glu Val Arg Phe He 
545 550 555 560 

Lys Ser 



<210> 25 
<211> 2067 
<212> DNA 
<213> Homo Sapien 
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<220> 

<221> CDS 

<222> (1) . . . (2067) 

<223> Nucleic acid encoding (endotheliase 2-L) protein 
<400> 25 

atg gag agg gac age cac ggg aat gca tct cca gca aga aca cot tea 
Met Glu Arg Asp Ser His Gly Asn Ala Ser Pro 7U.a Arg Thr Pro Ser 
15 10 IS 

get gga gca tct cca gcc cag gca tct cca get ggg aca cct cca ggc 
Ala Gly Ala Ser Pro Ala Gin Ala Ser Pro Ala Gly Thr Pro Pro Gly 
20 25 30 



cct cca ggc egg gca tct cca ggc egg gca tct cca gcc cag gca tct 
Pro Pro Gly Arg Ala Ser Pro Gly Arg Ala Ser Pro Ala Gin Ala Ser 

70 75 80 



65 



48 



96 



egg gca tct cca gcc cag gca tct cca gcc cag gca tct cca get ggg 144 
Arg Ala Ser Pro Ala Gin Ala Ser Pro Ala Gin Ala Ser Pro Ala Gly 
35 40 45 

aca cct ccg ggc egg gca tct cca gcc cag gca tct cca get ggt aca 192 
Thr Pro Pro Gly Arg Ala Ser Pro Ala Gin Ala Ser Pro Ala Gly Thr 
50 55 60 



240 



2BB 



cca gcc egg gca tct ccg get ctg gca tea ctt tec agg tec tea tec 
Pro Ala Arg Ala Ser Pro Ala Leu Ala Ser Leu Ser Arg Ser Ser Ser 

85 90 95 

gqc agg tea tea tec gcc agg tea gcc teg gtg aca acc tec cca ace 
Glv Arg Ser Ser Ser Ala Arg Ser Ala Ser Val Thr Thr Ser Pro Thr 
100 105 110 

aga gtg tac ctt gtt aga gca aca cca gtg ggg get gta ccc ate cga 
Arg Val Tyr Leu Val Arg Ala Thr Pro Val Gly Ala Val Pro lie Arg 
115 120 125 

tea tct cct gcc agg tea gca cca gca acc agg gcc acc agg gag age 
Ser Ser Pro Ala Arg Ser Ala Pro Ala Thr Arg Ala Thr Arg Glu Ser 
130 135 140 

cca ggt acg age ctg ccc aag ttc acc tgg egg gag ggc cag aag cag 
Pro Gly Thr Ser Leu Pro Lys Phe Thr Trp Arg Glu Gly Gin Lys Gin 
145 150 155 160 

eta ccg etc ate ggg tgc gtg etc etc etc att gcc ctg gtg gtt teg 
Leu Pro Leu lie Gly Cys Val Leu Leu Leu lie Ala Leu Val Val Ser 

165 170 175 

etc ate ate etc ttc cag ttc tgg cag ggc cac aca ggg ate agg tac 576 
Leu He He Leu Phe Gin Phe Trp Gin Gly His Thr Gly He Arg Tyr 
180 185 190 

aag gag cag agg gag age tgt ccc aag cac get gtt cgc tgt gac ggg 624 
Lye Glu Gin Arg Glu Ser Cys Pro Lys His Ala Val Arg Cys Asp Gly 
195 200 205 

gtg gtg gac tgc aag ctg aag agt gac gag ctg ggc tgc gtg agg ttt 672 
Val Val Asp Cys Lys Leu Lys Ser Asp Glu Leu Gly Cys Val Arg Phe 
210 215 220 



336 



3B4 



432 



480 



528 
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gac tgg gac aag tct ctg ctt aaa ate tac tct ggg tec tec cat cag 720 
Asp Trp Asp Lys Ser Leu Leu Lys lie Tyr Ser Gly Ser Ser His Gin 
225 230 235 240 

tgg ctt ccc ate tgt age age aac tgg aat gac tec tac tea gag aag 768 
Trp Leu Pro lie Cys Ser Ser Asn Trp Asn Asp Ser Tyr Ser Glu Lys 

245 250 255 

acc tgc eag cag ctg ggt ttc gag agt get cac egg aca .aec gag gtt 816 
Thr CyB Gin Gin Leu Gly Phe Glu Ser Ala His Arg Thr Thr Glu Val 
260 265 270 



gcc cac agg gat ttt gcc aac age ttc tea ate ttg aga tac aac tec 
Ala His Arg Asp Phe Ala Asn Ser Phe Ser He Leu Arg Tyr Asn Ser 
275 280 285 

acc ate cag gaa age etc cac agg tct gaa tgc ect tec cag egg tat 
Thr He Gin Glu Ser Leu His Arg Ser Glu Cys Pro Ser Gin Arg Tyr 
290 295 300 

ate tec etc cag tgt tec cac tgc gga ctg agg gcc atg ace ggg egg 
He Ser Leu Gin Cys Ser His Cys Gly Leu Arg Ala Met Thr Gly Arg 
305 310 315 320 

ate gtg gga ggg gcg ctg gee teg gat age aag tgg ect tgg caa gtg 
He Val Gly Gly Ala Leu Ala Ser Asp Ser Lys Trp Pro Trp Gin Val 

325 330 335 

agt ctg cac ttc ggc aec acc cac ate tgt gga ggc acg etc att gac 
Ser Leu His Phe Gly Thr Thr His He Cys Gly Gly Thr Leu He Asp 
340 345 350 



cag ttg ect gag gca gcc tee att gcc gag ate ate ate aac age aat 
Gin Leu Pro Glu Ala Ala Ser He Ala Glu He He He Asn Ser Asn 
305 390 395 400 



aag ccc ctg acc ctg tec get cac ate cac ect get tgc etc ccc atg 
Lys Pro Leu Thr Leu Ser Ala His He His Pro Ala Cys Leu Pro Met 
420 425 430 



ggc aag acc agg gag aca gat gac aag aca tec ccc ttc etc egg gag 
Gly Lys Thr Arg Glu Thr Asp Asp Lys Thr Ser Pro Phe Leu Arg Glu 
450 455 460 



864 



912 



960 



1008 



1056 



gcc cag tgg gtg etc act gcc gee cac tgc ttc ttc gtg ace egg gag 1104 
Ala Gin Trp Val Leu Thr Ala Ala His Cys Phe Phe Val Thr Arg Glu 
355 360 365 

aag gtc ctg gag ggc tgg aag gtg tac gcg ggc acc age aac ctg cac 1152 
Lys Val Leu Glu Gly Trp Lys Val Tyr Ala Gly Thr Ser Asn Leu His 
370 375 380 



1200 



tac aec gat gag gag gac gac tat gac ate gcc etc atg egg ctg tec 1248 
Tyr Thr Asp Glu Glu Asp Asp Tyr Asp He Ala Leu Met Arg Leu Ser 

405 410 415 



1296 



cat gga cag acc ttt age etc aat gag acc tgc tgg ate aca ggc ttt 1344 
His Gly Gin Thr Phe Ser Leu Asn Glu Thr Cys Trp He Thr Gly Phe 
435 440 445 



1392 
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gtg cag gtc aat etc ate gac ttc aag aaa tgc aat gac tac ttg gtc 1440 
Val Gin Val Asn Leu He Asp Phe Lys Lys Cys Asn Asp Tyr Leu Val 
465 470 475 480 

tat gac agt tac ctt acc cca agg atg atg tgt get ggg gae ctt cgt 1488 
Tyr Asp Ser Tyr Leu Thr Pro Arg Met Met Cya Ala Gly Asp Leu Arg 

485 490 495 

S59 99c aga gac tec tgc cag gga gac age ggg ggg cet ctt gtc tgt 1536 
Gly Gly Arg Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys 
500 505 510 

gag cag aac aac cgc tgg tac ctg gca ggt gtc acc age tgg ggc aca 1584 
Glu Gin Asn Asn Arg Trp Tyr Leu Ala Gly Val Thr Ser Trp Gly Thr 
515 520 525 

ggc tgt ggc cag aga aac aaa cct ggt gtg tac acc aaa gtg aca gaa '1632 
Gly Cys Gly Gin Arg Asn Lys Pro Gly Val Tyr Thr Lys Val Thr Glu 
530 535 540 

gtt ctt ccc tgg att tac age aag atg gag aac aga get cag egg gtt 1680 
Val Leu Pro Trp He Tyr Ser Lys Met Glu Asn Arg Ala Gin Arg Val 
545 550 555 560 

gaa aaa gcg tgg acc tac agg cca ggc agg cag ttg ctg ggc aga tgt 1728 
Glu Lys Ala Trp Thr Tyr Arg Pro Gly Arg Gin Leu Leu Gly Arg Cys 

565 570 575 

tct ccc aga agt att ttt ttg tgt aag gtt gca atg gac ttt gaa aac 1776 
Ser Pro Arg Ser He Phe Leu Cys Lys Val Ala Met Asp Phe Glu Asn 
580 585 590 



gtt tea gtt tct gca gag gat ttt gtg ata gtt ttt gtt ate aag cat 
Val Ser Val Ser Ala Glu Asp Phe Val He Val Phe Val He Lys His 
595 600 605 



aac aca agt gac tec att ttg act ctg aca act ttc aca get gtc acc 
Asn Thr Ser Asp Ser He Leu Thr Leu Thr Thr Phe Thr Ala Val Thr 

645 650 655 



1824 



tta tgc atg gga ate cgc tct tea tgg cct ttc cca get ctg ttt gtt 1872 

Leu Cys Met Gly lie Arg Ser Ser Trp Pro Phe Pro Ala Leu Phe Val 
610 615 620 

tta gtc ttt ttg att ttc ttt ttg ttg ttg ttg ttg tct ttt tta aaa 1920 

Leu Val Phe Leu He Phe Phe Leu Leu Leu Leu Leu Ser Phe Leu Lys 
625 630 635 640 



1968 



aga atg etc cct gag aac tac cat tct ttc cct ttc cca ctt aaa ata 2016 
Arg Met Leu Pro Glu Asn Tyr His Ser Phe Pro Phe Pro Leu Lys He 
660 665 670 

ttt cat cag aac etc act act ate ata aaa gag tat aaa gta ata aaa 2064 
Phe His Gin Asn Leu Thr Thr He He Lys Glu Tyr Lys Val He Lye 
675 680 685 

taa 2067 

<210> 26 
<211> 688 
<212> PRT 
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<213> Homo Sapien 
<220> 

<221> protease domain 
<222> (321) . . (68B) 
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His Gly Gin Thr Phe Ser Leu Asn Glu Thr Cys Trp lie Thr Gly Phe 

435 440 445 

Gly Lys Thx Arg Glu Thr Asp Asp Lys Thr Ser Pro Phe Leu Arg Glu 

450 455 460 

Val Gin Val Asn Leu lie Asp Phe Lys Lys Cys Asn Asp Tyr Leu Val 
465 470 475 480 

Tyr Asp Ser Tyr Leu Thr Pro Arg Met Met Cys Ala Gly Asp Leu Arg 

485 490 495 

Gly Gly Arg Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys 

500 505 510 

Glu Gin Asn Asn Arg Trp Tyr Leu Ala Gly Val Thr Ser Trp Gly Thr 

515 520 525 

Glv Cvs Gly Gin Arg Asn Lys Pro Gly Val Tyr Thr Lys Val Thr Glu 

530 535 540 

Val Leu Pro Trp lie Tyr Ser Lys Met Glu Asn Arg Ala Gin Arg Val 
545 550 555 560 

Glu Lys Ala Trp Thr Tyr Arg Pro Gly Arg Gin Leu Leu Gly Arg Cys 

565 570 575 

Ser Pro Arg Ser lie Phe Leu Cys Lys Val Ala Met Asp Phe Glu Asn 

580 585 590 

Val Ser Val Ser Ala Glu Asp Phe Val lie Val Phe Val lie Lys His 

595 600 605 

Leu Cys Met Gly lie Arg Ser Ser Trp Pro Phe Pro Ala Leu Phe Val 

610 615 620 

Leu Val Phe Leu lie Phe Phe Leu Leu Leu Leu Leu Ser Phe Leu Lys 
625 630 635 640 

Asn Thr Ser Asp Ser He Leu Thr Leu Thr Thr Phe Thr Ala Val Thr 

645 650 655 

Atq Met Leu Pro Glu Asn Tyr His Ser Phe Pro Phe Pro Leu Lys lie 

^ 660 665 670 

Phe His Gin Asn Leu Thr Thr He He Lys Glu Tyr Lys Val He Lys 
675 680 685 

<210> 27 

<211> 1471 

<212> DNA 

<213> Homo Sapien 

<220> 

<223> DESCl gene 

<221> misc_feature 
<222> (626) . . . (1324) 
<223> protease domain 

<221> CDS 

<222> (56) . . . (1324) 

<400> 27 ^ t. 

tgacttggat gtagacctcg accttcacag gactcttcat tgctggttgg caatg atg 

1 



58 



tat egg cca gat gtg gtg agg get agg aaa aga gtt tgt tgg gaa ccc 106 
Tyr Arg Pro Asp Val Val Arg Ala Arg Lys Arg Val Cys Trp Glu Pro 
S 10 15 

tqq gtt ate ggc etc gtc ats ttc ata tec ctg att gtc ctg gca gtg 154 
Trp Val He Gly Leu Val Xaa Phe He Ser Leu He Val Leu Ala Val 
20 25 30 
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tgc att gga stc act gtt cat tat gtg aga tat aat caa aag aag acc 
Cye He Gly Xaa Thr Val His Tyr Val Arg Tyr Asn Gin Lys Lys Thx 



35 40 45 



gag ttt ggc aga gag get tct aac aat ttt aca gaa atg age cag aga 
Glu Phe Gly Arg Glu Ala Ser Asn Asn Phe Thr Glu Met Ser Gin Axg 

70 75 80 



tag act get tec ttt gga gta aca ata aaa cct teg aaa atg aaa egg 
Trp Thr Ala Ser Phe Gly Val Thr He Lye Pro Ser Lys Met Lys Arg 
245 250 255 

qqt etc egg aga ata att gtc eat gaa aaa tac aaa cae cca tea cat 
GlY Leu Arg Arg He He Val His Glu Lys Tyr Lys His Pro Ser His 
260 265 270 

gac tat gat att tct ctt gca gag ctt tct age cct gtt cee tac aca 



202 



tac aat tac tat age aca ttg tea ttt aca act gac aaa eta tat get 250 
Tyr Asn Tyr Tyr Ser Thr Leu Ser Phe Thr Thr Asp Lys Leu Tyr Ala 
50 55 60 65 



29B 



ctt gaa tea atg gtg aaa aat gca ttt tat aaa tct cca tta agg gaa 346 
Leu Glu Ser Met Val Lys Asn Ala Phe Tyr Lys Ser Pro Leu Arg Glu 
85 90 95 

gaa ttt gtc aag tct cag gtt ate aag ttc agt caa cag aag cat gga 394 
Glu Phe Val Lys Ser Gin Val He Lys Phe Ser Gin Gin Lys His Gly 
100 105 HO 

gtg ttg get cat atg ctg ttg att tgt aga ttt cac tct act gag gat 442 
Val Leu Ala His Met Leu Leu He Cys Arg Phe His Ser Thr Glu Asp 
lis 120 125 

cct gaa act gta gat aaa att gtt caa ctt gtt tta cat gaa aag ctg 490 
Pro Glu Thr Val Asp Lys He Val Gin Leu Val Leu His Glu Lys Leu 
130 135 140 145 

caa gat get gta gga ecc cct aaa gta gat cct cac tea gtt aaa att 
Gin ASP Ala Val Gly Pro Pro Lys Val Asp Pro His Ser Val Lys He 

150 1S5 160 

aaa aaa ate aac aag aca gaa aca gac age tat eta aac cat tgc tgc 
Lvs Lys He Asn Lys Thr Glu Thr Asp Ser Tyr Leu Asn His Cys Cys 
165 170 175 

gga aca cga aga agt aaa act eta ggt cag agt etc agg ate gtt ggt 
Gly Thr Arg Arg Ser Lys Thr Leu Gly Gin Ser Leu Arg He Val Gly 
180 185 190 

aaa aca gaa gta gaa gag ggt gaa tgg ccc tgg cag get age ctg cag 
Gly Thr Glu Val Glu Glu Gly Glu Trp Pro Trp Gin Ala Ser Leu Gin 
195 200 205 

tqq gat ggg agt cat egc tgt gga gca acc tta att aat gee aca tgg 
Trn ASP Gly Ser His Arg Cys Gly Ala Thr Leu He Asn Ala Thr Tip 
210 215 220 225 

Ctt gtg agt get get cae tgt ttt aca aca tat aag aac cct gee aga 778 
Leu Val Ser Ala Ala His Cys Phe Thr Thr Tyr Lys Asn Pro Ala Arg 

230 235 240 - 



538 



586 



634 



682 



730 



826 



874 



922 
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Asp Tyr Asp lie Ser Leu .Ala Glu Leu Ser Ser Pro Val Pro Tyr Thr 
275 2B0 285 

aat gca gta cat aga gtt tgt etc cot gat gca tec tat gag ttt caa 970 
Asn Ala Val His Arg Val Cys Leu Pro Asp Ala Ser Tyr Glu Phe Gin 
290 295 300 305 

oca ggt gat gtg atg ttt gtg aca gga ttt gga gca ctg aaa aat gat 1018 
Pro Gly Asp Val Met Phe Val Thr Gly Phe Gly Ala Leu Lys Asn Asp 

310 315 320 

ggt tac agt caa aat cat ctt cga caa gca cag gtg act etc ata gac 1066 
Gly Tyr Ser Gin Asn His Leu Arg Gin Ala Gin Val Thr Leu lie Asp 
325 330 335 

get aca act tge aat gaa cct caa get tac aat gac gcc ata act cet 1114 
Ala Thr Thr Cys Asn Glu Pro Gin Ala Tyr Asn Asp Ala lie Thr Pro 
340 345 350 

aga atg tta tgt get ggc tee tta gaa gga aaa aca gat gca tgc cag 1162 
Arg Met Leu Cys Ala Gly Ser Leu Glu Gly Lys Thr Asp Ala Cys Gin 
355 360 365 

ggt gac tct gga gga cca ctg gtt agt tea gat get aga gat ate tgg 1210 
Gly Asp Ser Gly Gly Pro Leu Val Ser Ser Asp Ala Arg Asp lie Trp 
370 375 3B0 385 



tac ctt get gga ata gtg age tsg gga gat gaa tgt gcg aaa cec aac 
Tyr Leu Ala Gly lie Val Ser Xaa Gly Asp Glu Cys Ala Lys Pro Asn 

390 395 400 

aag cct ggt gtt tat act aga gtt acg gcc ttg egg gac tgg att act 
Lys Pro Gly Val Tyr Thr Arg Val Thr Ala Leu Arg Asp Trp He Thr 
405 410 415 



1258 



1306 



tea aaa act ggt ate taa gagagaaaag cctcatggaa cagataacat 1354 
Ser Lys Thr Gly He * 
420 

ttttttttgt tttttgggtg tggaggccat ttttagagat acagaattgg agaagaettg 1414 
caaaacagct agatttgact gatctcaata aaetgtttgc ttgatgcaaa aaaaaaa 1471 

<210> 28 
<211> 4933 
<212> DNA 
<213> Homo Sapien 

<220> 
<221> CDS 

<222> (94) . . . (3222) 

<223> Nucleotide sequence encoding corin 
<300> 

<308> GenBank AF133e45 
<309> 1999-05-24 

<400> -28 

aaatcatccg tagtgcctcc cegggggaca cgtagaggag agaaaagega ccaagataaa 60' 
agtggacaga agaataagcg agacttttta tec atg aaa cag tct cct gee etc 114 

Met Lys Gin Ser Pro Ala Leu 
1 5 
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gct ccg gaa gag cgc tac cgc aga gcc ggg tec cca aag ccg gtc ttg 162 
Ala Pro Glu Glu Arg Tyr Arg Axg Ala Gly Ser Pro Lys Pro Val Leu 
10 15 20 

aga get gat gac aat aac atg ggc aat ggc tgc tct cag aag ctg gcg 210 
Arg Ala Asp Aep Asn Asn Met Gly Asn Gly Cys Ser Gin Lys Leu Ala 
25 30 35 

act get aac etc etc egg ttc eta ttg ctg gtc ctg att cca tgt ate 258 
Thr Ala Asn Leu Leu Arg Phe Leu Leu Leu Val Leu lie Pro Cys lie 
40 45 50 55 

tgt get etc gtt etc ttg ctg gtg ate ctg ctt tec tat gtt gga aca 306 
Cys Ala Leu Val Leu Leu Leu Val lie Leu Leu Ser Tyr Val Gly Thr 

60 65 70 

tta caa aag gtc tat ttt aaa tea aat ggg agt gaa ect ttg gtc act 354 
Leu Gin Lye Val Tyr Phe Lys Ser Asn Gly Ser Glu Pro Leu Val Thr 
75 80 85 



gat ggt gaa ate caa ggg tec gat gtt att ctt aca aat aca att tat 

Asp Gly Glu lie Gin Gly. Ser Asp Val lie Leu Thr Asn Thr lie Tyr 

90 95 100 

aac cag age act gtg gtg tct act gca cat ccc gac caa cac gtt eca 

Asn Gin Ser Thr Val Val Ser Thr Ala His Pro Asp Gin His Val Pro 

105 110 115 

gcc tgg act acg gat get tct etc eca ggg gac caa agt cac agg aat 

Ala Trp Thr Thr Asp Ala Ser Leu Pro Gly Asp Gin Ser His Arg Asn 

120 125 130 135 

aca agt gee tgt atg aac ate acc cac age cag tgt cag atg ctg ccc 

Thr Ser Ala Cys Met Asn lie Thr His Ser Gin Cys Gin Met Leu Pro 

140 145 ISO 

tac cac gee acg ctg aca ect etc etc tea gtt gtc aga aac atg gaa 
Tyr His Ala Thr Leu Thr Pro Leu Leu Ser Val Val Arg Asn Met Glu 

155 160 165 

atg gaa aag ttc etc aag ttt ttc aca tat etc cat cgc etc agt tgc 

Met Glu Lys Phe Leu Lys Phe Phe Thr Tyr Leu His Arg Leu Ser Cys 

170 175 180 



tac tec tgg ccg gat ttc etc aga tgc tec cag ttt aga aac caa act 
Tyr Ser Trp Pro Asp Phe Leu Arg Cys Ser Gin Phe Arg Asn Gin Thr 
235 240 245 



402 



450 



498 



546 



594 



642 



tat caa cat ate atg ctg ttt ggc tgt acc etc gcc ttc ect gag tgc 690 

Tyr Gin His He Met Leu Phe Gly Cys Thr Leu Ala Phe Pro Glu Cys 
185 190 195 

ate att gat ggc gat gac agt cat gga etc ctg ccc tgt agg tec ttc 

He He Asp Gly Asp Asp Ser His Gly Leu Leu Pro Cys Arg Ser Phe 

200 205 210 215 



738 



tgt gag get gca aaa gaa ggc tgt gaa tea gtc ctg ggg atg gtg aat 786 
Cys Glu Ala Ala Lys Glu Gly Cys Glu Ser Val Leu Gly Met Val Asn 

220 225 230 



834 
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gaa age age aat gtc age aga att tgc ttc tea cct cag cag gaa aac 882 

Glu Ser Ser Asn Val Ser Arg lie Cys Phe Ser Pro Gin Gin Glu Asn 
250 255 260 

gga aag caa ttg etc tgt gga agg ggt gag aac ttt ctg tgt gee agt 93 0 

Gly Lys Gin Leu Leu Cys Gly Arg Gly Glu Asn Phe Leu Cys Ala Ser 

265 270 275 



gga ate tgc ate ccc ggg aaa ctg caa tgt aat ggc tac aac gac tgt 

Gly lie Cys He Pro Gly Lys Leu Gin Cys Asn Gly Tyr Asn Asp Cys 

280 285 290 295 

gac gac tgg agt gac gag get cat tgc aac tgc age gag aat ctg ttt 

Asp Asp Trp Ser Asp Glu Ala His Cys Asn Cys Ser Glu Asn Leu Phe 

300 305 310 



978 



1026 



cac tgt cac aca ggc aag tgc ctt aat tac age ctt gtg tgt gat gga 1074 
His Cys His Thr Gly Lys Cys Leu Asn Tyr Ser Leu Val Cys Asp Gly 
315 ^ 320 325 

tat gat gac tgt ggg gat ttg agt gat gag caa aac tgt gat tgc aat 1122 
Tyr Asp Asp Cys Gly Asp Leu Ser Asp Glu Gin Asn Cys Asp Cys Asn 
330 335 340 

ccc aca aca gag eat cgc tgc ggg gac ggg cgc tgc ate gee atg gag 1170 
Pro Thr Thr Glu His Arg Cys Gly Asp Gly Arg Cys He Ala Met Glu 
345 350 355 

tgg gtg tgt gat ggt gac cac gac tgt gtg gat aag tec gac gag gtc 1218 
Trp Val Cys Asp Gly Asp His Asp Cys Val Asp Lys Ser Asp Glu Val 
360 365 370 375 

aac tgc tec tgt cac age cag ggt ctg gtg gaa tgc aga aat gga caa 1266 
Asn Cys Ser Cys His Ser Gin Gly Leu Val Glu Cys Arg Asn Gly Gin 

380 385 390 

tgt ate ccc age acg ttt caa tgt gat ggt gac gag gac tgc aag gat 1314 
Cys He Pro Ser Thr Phe Gin Cys Asp Gly Asp Glu Asp Cys Lys Asp 
395 400 405 

ggg agt gat gag gag aac tgc age gtc att cag act tea tgt caa gaa 13 62 

Gly Ser Asp Glu Glu Asn Cys Ser Val He Gin Thr Ser Cys Gin Glu 
410 415 420 

gga gac caa aga tgc etc tac aat ccc tgc ctt gat tea tgt ggt ggt 1410 
Gly Asp Gin Arg Cys Leu Tyr Asn Pro Cys Leu Asp Ser Cys Gly Gly 
425 430 435 

age tct etc tgt gac ceg aac aac agt ctg aat aac tgt agt caa tgt 1458 
Ser Ser Leu Cys Asp Pro Asn Asn Ser Leu Asn Asn Cys Ser Gin Cys 
440 445 450 455 

gaa cca att aca ttg gaa etc tgc atg aat ttg ccc tac aac agt aca 1506 
Glu Pro He Thr Leu Glu Leu Cys Met Asn Leu Pro Tyr Asn Ser Thr 

460 465 470 

agt tat cca aat tat ttt ggc cac agg act caa aag gaa gca tec ate 1554 
Ser Tyr Pro Asn Tyr Phe Gly His Arg Thr Gin Lys Glu Ala Ser He 
475 480 485 



age tgg gag tct tct ctt ttc cct gca ctt gtt caa ace aac tgt tat 



1602 
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Ser Trp Glu Ser Ser Leu Phe Pro Ala Leu Val Gin Thr Asn Cys Tyr 
' 490 495 500 

aaa tac etc atg ttc ttt tct tgc acc att ttg gta cca aaa tgt gat 1650 
Lys Tyr Leu Met Phe Phe Ser Cys Thr lie Leu Val Pro Lys Cys Asp 
505 510 515 

gtg aat aca ggc gag cgt ate cct cct tgc agg gca ttg tgt gaa cac 1698 
Val Asn Thr Gly Glu Arg lie Pro Pro Cys Arg Ala Leu Cys Glu His 
520 525 530 535 

tct aaa gaa cgc tgt gag tct gtt ctt ggg att gtg ggc eta cag tgg 1746 
Ser Lys Glu Arg Cys Glu Ser Val Leu Gly He Val Gly Leu Gin Trp 

540 545 550 

cct gaa gac aca gat tgc agt caa ttt cca gag gaa aat tea gac aat 1794 
Pro Glu Asp Thr Asp Cys Ser Gin Phe Pro Glu Glu Asn Ser Asp Asn 
555 560 565 

caa acc tgc ctg atg ect gat gaa tat gtg gaa gaa tgc tea cct agt 1842 
Gin Thr Cys Leu Met Pro Asp Glu Tyr Val Glu Glu Cys Ser Pro Ser 
570 575 580 

cat ttc aag tgc cgc tea gga cag tgt gtt ctg get tec aga aga tgt 1890 
His Phe Lys Cys Arg Ser Gly Gin Cys Val Leu Ala Ser Arg Arg Cys 
585 590 595 

gat ggc cag gcc gac tgt gac gat gac agt gat gag gaa aac tgt ggt 1938 
Asp Gly Gin Ala Asp Cys Asp Asp Asp Ser Asp Glu Glu Aen Cys Gly 
600 605 610 615 

tgt aaa gag aga gat ctt tgg gaa tgt cca tec aat aaa caa tgt ttg 1986 
Cys Lys Glu Arg Asp Leu Trp Glu Cys Pro Ser Asn Lys Gin Cys Leu 

620 625 630 

aag cac aca gtg ate tgc gat ggg ttc cca gac tgc cct gat tac atg 
Lye His Thr Val lie Cys Asp Gly Phe Pro Asp Cys Pro Asp Tyr Met 
635 640 645 

gac gag aaa aac tgc tea ttt tgc caa gat gat gag ctg gaa tgt gca 
Asp Glu Lys Asn Cys Ser Phe Cys Gin Asp Asp Glu Leu Glu Cys Ala 
650 655 660 

aac cat gcg tgt gtg tea cgt gac ctg tgg tgt gat ggt gaa gcc gac 2130 
Asn His Ala Cys Val Ser Arg Asp Leu Trp Cys Asp Gly Glu Ala Asp 
665 670 675 



tgc tea gac agt tea gat gaa tgg gac tgt gtg acc etc tct ata aat 
Cys Ser Asp Ser Ser Asp Glu Trp Asp Cys Val Thr Leu Ser He Asn 
680 685 690 695 

gtg aac tec tct tec ttt ctg atg gtt cac aga get gcc aca gaa cac 
Val Asn Ser Ser Ser Phe Leu Met Val His Arg Ala Ala Thr Glu His 

700 70S 710 



2034 



2082 



2178 



2226 



eat gtg tgt* gca gat ggc tgg cag gag ata ttg agt cag ctg gcc tgc 2274 

His Val Cys Ala Asp Gly Trp Gin Glu He Leu Ser Gin Leu Ala Cys 
715 720 725 

aag cag atg ggt tta gga gaa cca tct gtg acc aaa ttg ata cag gaa 2322 

Lys Gin Met Gly Leu Gly Glu Pro Ser Val Thr Lys Leu He Gin Glu 
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730 735 740 

cag gag aaa gag ccg egg tgg ctg aca tta cac tec aac tgg gag age 2370 
Gin Glu Lya Glu Pro Arg Trp Leu Thr Leu His Ser Asn Trp Glu Ser 
745 750 755 

etc aat ggg ace act tta cat gaa ctt eta gta aat ggg cag tct tgt 2418 
Leu Asn Gly Thr Thr Leu His Glu Leu Leu Val Asn Gly Gin Ser Cys 
760 765 770 775 

gag age aga agt aaa att tct ctt ctg tgt act aaa caa gae tgt ggg 2466 
Glu Ser Arg Ser Lys lie Ser Leu Leu Cys Thr Lys Gin Asp Cys Gly 

780 785 790 

cgc cgc cct get gee cga atg aac aaa agg ate ctt gga ggt egg acg 2514 
Arg Arg Pro Ala Ala Arg Met Asn Lys Arg lie Leu Gly Gly Arg Thr 
795 BOO 805 

agt cgc eet gga agg tgg cca tgg cag tgt tct ctg cag agt gaa ccc 2562 
Ser Arg Pro Gly Arg Trp Pro Trp Gin Cys Ser Leu Gin Ser Glu Pro 
810 815 820 

agt gga cat ate tgt ggc tgt gtc etc att gee aag aag tgg gtt ctg 2610 
Ser Gly His lie Cys Gly Cys Val Leu He Ala Lys Lys Trp Val Leu 
825 830 835 

aca gtt gee cac tgc ttc gag ggg aga gag aat get gca gtt tgg aaa 2658 
Thr Val Ala His Cys Phe Glu Gly Arg Glu Asn Ala Ala Val Trp Lys 
840 845 850 855 



gtg gtg ctt ggc ate aac aat eta gae cat cca tea gtg ttc atg cag 
Val Val Leu Gly He Asn Asn Leu Asp His Pro Ser Val Phe Met Gin 

860 855 870 



gtg gtg gae tat gae ate age ate gtt gag ctg agt gaa gae ate agt 
Val Val Asp Tyr Asp He Ser He Val Glu Leu Ser Glu Asp He Ser 
890 895 900 

gag act ggc tac gtc egg cct gtc tgc ttg ccc aac ccg gag cag tgg 
Glu Thr Gly Tyr Val Arg Pro Val Cys Leu Pro Asn Pro Glu Gin Trp 
905 910 915 

eta gag cct gae acg tac tgc tat ate aca ggc tgg ggc cac atg ggc 
Leu Glu Pro Asp Thr Tyr Cys Tyr He Thr Gly Trp Gly His Met Gly 
920 925 930 935 

aat aaa atg cca ttt aag ctg caa gag gga gag gtc cgc att att tct 
Asn Lys Met Pro Phe Lys Leu Gin Glu Gly Glu Val Arg He He Ser 

940 945 950 

ctg gaa cat tgt cag tec tac ttt gae atg aag acc ate acc act egg 
Leu Glu His Cys Gin Ser Tyr Phe Asp Met Lys Thr He Thr Thr Arg 
955 960 965 

atg at a tgt get ggc tat gag tct ggc aca gtt gat tea tgc atg ggt 
Met He Cys Ala Gly Tyr Glu Ser Gly Thr Val Asp Ser Cys Met Gly 
970 975 980 



2706 



aca cgc ttt gtg aag ace ate ate ctg cat ccc cgc tac agt cga gca 2754 
Thr Arg Phe Val Lya Thr He He Leu His Pro Arg Tyr Ser Arg Ala 
875 880 885 



2802 



2850 



2898 



2946 



2994 



3042 
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gac age ggt ggg cct ctt gtt tgt gag aag cct gga gga egg tgg aca 3 090 

Asp Ser Gly Gly Pro Leu Val Cye Glu Lys Pro Gly Gly Arg Trp Thr 
985 990 995 

tta ttt gga tta act tea tgg ggc tec gtc tgc ttt tec aaa gtc ctg 3138 
Leu Phe Gly Leu Thr Ser Trp Gly Ser Val Cys Phe Ser Lys Val Leu 
1000 1005 1010 1015 

ggg cct ggc gtt tat agt aat gtg tea tat ttc gtc gaa tgg att aaa 3186 
Gly Pro Gly Val Tyr Ser Asn Val Ser Tyr Phe Val Glu Trp lie Lys 

1020 1025 1030 

aga cag att tac ate cag acc ttt etc eta aac taa ttataaggat 3232 
Arg Gin lie Tyr lie Gin Thr Phe Leu Leu Asn * 
1035 1040 

gatcagagac ttttgccage tacactaaaa gaaaatggcc ttcttgaetg tgaagagetg 3292 

cctgcagaga gctgtacaga agcacttttc atggacagaa atgctcaatc gtgcaetgca 3352 

aatttgcatg tttgttttgg actaattttt ttcaatttat ttttteaect tcatttttct 3412 

cttatttcaa gtteaatgaa agactttaca aaagcaaaca aagcagactt tgtccttttg 3472 

ccaggcctaa ccatgactgc agcacaaaat tategactct ggcgagattt aaaatcaggt 3532 

gctacagtaa caggttatgg aatggtctct tttatcctat cacaaaaaaa gacatagata 3592 

tttaggctga ttaattatct ctaccagttt ttgtttctca agctcagtgc atagtggtaa 3 652 

atttcagtgt taacattgga gacttgcttt tctttttett tttttatacc ccacaattct 3712 

tttttattac acttcgaatt ttagggtaca cgagcacaae gtgcaggtta gttaeatatg 3772 

tatacatgtg ceatgttggt gtgctgaacc eagtaactcg tcatttgatt tattaaaagc 3 832 

caagataatt tacatgttta aagtatttac tattaecccc ttctaatgtt tgcataattc 3B92 

tgagaactga taaaagacag eaataaaaga ecagtgteat ccatttaggt agcaagacat 3952 

attgaatgca aagttcttta gatatcaata ttaacacttg acattattgg accccccatt 4012 

ctggatgtat atcaagatca taattttata gaagagtetc tatagaactg tec teat age 4072 

tgggtttgtt eaggatatat gagttggctg attgagactg caacaactac atctatattt 4132 

atgggcaata ttttgtttta ettatgtggc aaagaactgg atattaaact ttgeaaaaga 4192 

gaatttagat gagagatgca attttttaaa aagaaaatta atttgcatcc ctcgtttaat 4252 

taaatttatt ttteagtttt cttgcgttca tccataccaa eaaagtcata aagagcatat 4312 

tttagagcac agtaagactt tgeatggagt aaaacatttt gtaattttcc tcaaaagatg 4372 

tttaatatct ggtttcttct cattggtaat taaaatttta gaaatgattt ttagetctag 4432 

gccactttac geaactcaat ttctgaagea attagtggta aaaagtattt ttceeeacta 4492 

aaaaacttta aaacacaaat cttcatatat acttaattta attagtcagg catceatttt 4552 

gccttttaaa caaetaggat tccctactaa cetccaccag caacctggac tgcctcagca 4612 

ttccaaatag atactaectg caattttata catgtatttt tgtatctttt ctgtgtgtaa 4672 

acatagttga aattcaaaaa gttgtagcaa tttctataet attcatctce tgtccttcag 4732 

tttgtataaa cctaaggaga gtgtgaaatc eagcaactga attgtggtca cgattgtatg 4792 

aaagttcaag aacatatgte agt ttt gtt a cagttgtagc tacatactca atgtatcaac 4852 

ttttagcetg ctcaacttag gctcagtgaa atatatatat tatacttatt ttaaataatt 4912 

cttaatacaa ataaaatggt a 4933 

<210> 29 

<211> 1042 

<212> PRT 

<213> Homo Sapien 

<400> 29 

Met Lys Gin Ser Pro Ala Leu Ala Pro Glu Glu Arg Tyr Arg Arg Ala 

15 10 15 

Gly Ser Pro Lys Pro Val Leu Arg Ala Asp Asp Asn Asn Met Gly Asn 

20 25 30 

Gly Cys Ser Gin Lys Leu Ala Thr Ala Asn Leu Leu Arg Phe Leu Leu 

35 40 45 

Leu Val Leu lie Pro Cys lie Cys Ala Leu Val Leu Leu Leu Val He 
SO 55 60 
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Leu Leu Ser Tyr Val Gly Thr Leu Gin Lys Val Tyr Phe Lys Ser Asn 
65 70 75 BO 

Gly Ser Glu Pro Leu Val Thr Asp Gly Glu He Gin Gly Ser Asp Val 

85 90 95 

He Leu Thr Asn Thr He Tyr Asn Gin Ser Thr Val Val Ser Thr Ala 

100 105 110 

His Pro Asp Gin His Val Pro Ala Trp Thr Thr Asp Ala Ser Leu Pro 

115 120 125 

Gly Asp Gin Ser Hie Arg Asn Thr Ser Ala Cys Met Asn He Thr His 

130 135 140 

Ser Gin Cys Gin Met Leu Pro Tyr His Ala Thr Leu Thr Pro Leu Leu 
145 150 155 160 

Ser Val Val Arg Asn Met Glu Met Glu Lys Phe Leu Lys Phe Phe Thr 

165 170 175 

Tvx Leu His Arg Leu Ser Cys Tyr Gin His He Met Leu Phe Gly Cys 

ISO 185 190 

Thr Leu Ala Phe Pro Glu Cys He He Asp Gly Asp Asp Ser Hxs Gly 

195 200 205 

Leu Leu Pro Cys Arg Ser Phe Cys Glu Ala Ala Lys Glu Gly Cys Glu 

210 215 220 

Ser Val Leu Gly Met Val Asn Tyr Ser Trp Pro Asp Phe Leu Arg Cys 
225 230. 235 240 

Ser Gin Phe Arg Asn Gin Thr Glu Ser Ser Asn Val Ser Arg He Cys 

245 250 255 

Phe Ser Pro Gin Gin Glu Asn Gly Lys Gin Leu Leu Cys Gly Arg Gly 

260 265 270 

Glu Asn Phe Leu Cys Ala Ser Gly He Cys He Pro Gly Lys Leu Gin 

275 280 285 

CVB Asn Gly Tyr Asn Asp Cys Asp Asp Trp Ser Asp Glu Ala His Cys 

^ 290 295 300 

Asn Cys Ser Glu Asn Leu Phe His Cys His Thr Gly Lys Cys Leu Asn 
305 310 315 320 

Tyr Ser Leu Val Cys Asp Gly Tyr Asp Asp Cys Gly Asp Leu Ser Asp 

' 325 330 335 

Glu Gin Asn Cys Asp Cys Asn Pro Thr Thr Glu His Arg Cys Gly Asp 

340 345 350 

Glv Arg Cys He Ala Met Glu Trp Val Cys Asp Gly Asp His Asp Cys 

355 360 . 365 

Val ASP Lys Ser Asp Glu Val Asn Cys Ser Cys His Ser Gin Gly Leu 

370 375 380 

Val Glu Cys Arg Asn Gly Gin Cys He Pro Ser Thr Phe Gin Cys Asp 
385 390 395 400 

Gly Asp Glu Asp Cys Lys Asp Gly Ser Asp Glu Glu Asn Cys Ser Val 

405 410 415 

He Gin Thr Ser Cys Gin Glu Gly Asp Gin Arg Cys Leu Tyr Asn Pro 

420 425 430 

Cys Leu Asp Ser Cys Gly Gly Ser Ser Leu Cys Asp Pro Asn Asn Ser 

435 440 445 

Leu Asn Asn Cys Ser Gin Cys Glu Pro He Thr Leu Glu Leu Cys Met 

450 455 460 

Asn Leu Pro Tyr Asn Ser Thr Ser Tyr Pro Asn Tyr Phe Gly His Arg 
465 470 475 480 

Thr Gin Lys Glu Ala Ser He Ser Trp Glu Ser Ser Leu Phe Pro Ala 

485 490 495 

Leu Val Gin Thr Asn Cys Tyr Lys Tyr Leu Met Phe Phe Ser Cys Thr 

500 505 510 

He Leu Val Pro Lys Cys Asp Val Asn Thr Gly Glu Arg He Pro Pro 

515 520 525 

Cys Arg Ala Leu Cys Glu His Ser Lys Glu Arg Cys Glu Ser Val Leu 

530 535 540 

Gly He Val Gly Leu Gin Trp Pro Glu Asp Thr Asp Cys Ser Gin Phe 
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545 550 555 560 

Pro Glu Glu Asn Ser Asp Asn Gin Thr Cys Leu Met Pro Asp Glu Tyr 

565 570 575 

Val Glu Glu Cye Ser Pro Ser His Phe Lys Cys Arg Ser Gly Gin Cys 

580 585 590 

Val Leu Ala Ser Arg Arg Cys Asp Gly Gin Ala Asp Cys Asp Asp Asp 

595 600 605 

Ser Asp Glu Glu Asn Cye Gly Cys Lys Glu Arg Asp Leu Trp Glu Cys 

610 615 620 

Pro Ser Asn Lys Gin Cys Leu Lys His Thr Val He Cys Asp Gly Phe 
625 630 635 640 

Pro Asp Cys Pro Asp Tyr Met Asp Glu Lys Asn Cys Ser Phe Cys Gin 

645 650 655 

Asp Asp Glu Leu Glu Cys Ala Asn His Ala Cys Val Ser Arg Asp Leu 

660 665 670 

Trp Cys Asp Gly Glu Ala Asp Cys Ser Asp Ser Ser Asp Glu Trp Asp 

675 680 685 

Cys Val Thr Leu Ser He Asn Val Asn Ser Ser Ser Phe Leu Met Val 

690 695 700 

His Arg Ala Ala Thr Glu His Hie Val Cys Ala Asp Gly Trp Gin Glu 
705 710 715 720 

He Leu Ser Gin Leu Ala Cys Lys Gin Met Gly Leu Gly Glu Pro Ser 

725 730 735 

Val Thr Lys Leu He Gin Glu Gin Glu Lys Glu Pro Arg Trp Leu Thr 

740 745 750 

Leu His Ser Asn Trp Glu Ser Leu Asn Gly Thr Thr Leu His Glu Leu 

755 760 765 

Leu Val Asn Gly Gin Ser Cys Glu Ser Arg Ser Lys He Ser Leu Leu 

770 775 780 

Cys Thr Lys Gin Asp Cys Gly Arg Arg Pro Ala Ala Arg Met Asn Lys 
785 790 795 800 

Arg He Leu Gly Gly Arg Thr Ser Arg Pro Gly Arg Trp Pro Trp Gin 

805 810 815 

Cys Ser Leu Gin Ser Glu Pro Ser Gly His He Cys Gly Cys Val Leu 

' 820 825 830 

He Ala Lys Lys Trp Val Leu Thr Val Ala His Cys Phe Glu Gly Arg 

835 840 845 

Glu Asn Ala Ala Val Trp Lys Val Val Leu Gly He Asn Asn Leu Asp 

850 855 860 

His Pro Ser Val Phe Met Gin Thr Arg Phe Val Lys Thr He He Leu 
865 870 875 880 

His Pro Arg Tyr Ser Arg Ala Val Val Asp Tyr Asp He Ser He Val 

885 890 895 

Glu Leu Ser Glu Asp He Ser Glu Thr Gly Tyr Val Ttrg Pro Val Cys 

900 905 910 

Leu Pro Asn Pro Glu Gin Trp Leu Glu Pro Asp Thr Tyr Cys Tyr He 

915 920 925 

Thr Gly Trp Gly His Met Gly Asn Lys Met Pro Phe Lys Leu Gin Glu 

930 935 940 

Glv Glu Val Arg He He Ser Leu Glu His Cys Gin Ser Tyr Phe Asp 
sis 950 955 960 

Met Lvs Thr He Thr Thr Arg Met He Cys Ala Gly Tyr Glu Ser Gly 

^ 965 970 975 

Thr Val Asp Ser Cys Met Gly Asp Ser Gly Gly Pro Leu Val Cys Glu 

980 985 990 

Lvs Pro Gly Gly Arg Trp Thr Leu Phe Gly Leu Thr Ser Trp Gly Ser 

^ 995 1000 1005 

Val Cys Phe Ser Lys Val Leu Gly Pro Gly Val Tyr Ser Asn Val Ser 

lOlO 1015 1020 

Tvr Phe Val Glu Trp He Lys Arg Gin He Tyr He Gin Thr Phe Leu 
1025 1030 1035 1040 



wo 02/095007 



PCTrtJS02/I6819 



-60- 



Leu Asn 

<210> 30 

<211> 3696 

<212> DNA 

<213> Homo Sapien 

<220> 
<221> CDS 

<222> (41) . . . (3100) 

<223> Nucleic acid encoding human enterokinase 
<300> 

<308> GenBank HSU09860 
<309> 1995-06-03 

<400> 30 

accagacagt tcttaaatta gcaagccttc aaaaccaaaa atg ggg teg aaa aga 55 

Met Gly Ser Lys Arg 
1 5 



ggc ata tct tct agg cat cat tct etc age tec tat gaa ate atg ttt 
Gly lie Ser Ser Arg His His Ser Leu Ser Ser Tyr Glu lie Met Phe 

10 15 20 



103 



gca get etc ttt gcc ata ttg gta gtg etc tgt get gga tta att gca 151 
Ala Ala Leu Phe Ala lie Leu Val Val Leu Cys Ala Gly Leu lie Ala 
25 30 35 



gta tec tgc ctg aca ate aag gaa tec caa cga ggt gca gca ctt gga 
Val Ser Cys Leu Thr He Lys Glu Ser Gin Arg Gly Ala Ala Leu Gly 
40 45 50 



199 



cag agt cat gaa gcc aga gcg aca ttt aaa ata aca tec gga gtt aca 247 
Gin Ser His Glu Ala Arg Ala Thr Phe Lys He Thr Ser Gly Val Thr 
55 60 65 

tat aat cct aat ttg caa gae aaa etc tea gtg gat ttc aaa gtt ctt 
Tvx Asn Pro Asn Leu Gin Asp Lys Leu Ser Val Asp Phe Lys Val Leu 
70 75 80 85 



295 



get ttt gac ctt cag caa atg ata gat gag ate ttt eta tea age aat 343 
Ala Phe Asp Leu Gin Gin Met He Asp Glu He Phe Leu Ser Ser Asn 

90 95 100 

ctg aag aat gaa tat aag aac tea aga gtt tta caa ttt gaa aat ggc 391 
Leu Lys Asn Glu Tyr Lys Asn Ser Arg Val Leu Gin Phe Glu Asn Gly 
105 110 115 



age att ata gtc gta ttt gae ctt ttc ttt gcc cag tgg gtg tea gat 
Ser lie He Val Val Phe T^p Leu Phe Phe Ala Gin Trp Val Ser Asp 
120 125 130 

caa aat gta aaa gaa gaa ctg att caa ggc ctt gaa gca aat aaa tee 
Gin Asn Val Lys Glu Glu Leu He Gin Gly Leu Glu Ala Asn Lys Ser 
135 140 145 



439 



487 



age caa ctg gtc act ttc cat att gat ttg aac age gtt gat ate eta 535 
Ser Gin Leu Val Thr Phe His He Asp Leu Asn Ser Val Asp He Leu 
ISO 155 160 165 
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gac aag eta aca acc acc agt cat ctg gca act cca gga aat gtc tea 5B3 
Asp Lys Leu Thr Thr Thr Ser His Leu Ala Thr Pro Gly Asn Val Ser 

170 175 180 

ata gag tgc ctg cot ggt tea agt cct tgt act gat get eta acg tgt 631 
lie Glu Cys Leu Pro Gly Ser Ser Pro Cys Thr Asp Ala Leu Thr Cys 
185 190 195 

ata aaa get gat tta ttt tgt gat gga gaa gta aac tgt cca gat ggt 679 
lie Lys Ala Asp Leu Phe Cys Asp Gly Glu Val Asn Cys Pro Asp Gly 
200 205 210 

tct gac gaa gac aat aaa atg tgt gcc aca gtt tgt gat gga aga ttt 727 
Ser Asp Glu Asp Asn Lys Met Cys Ala Thr Val Cys Asp Gly Arg Phe 
215 220 225 

ttg tta act gga tea tct ggg tct ttc cag get act cat tat cca aaa 775 
Leu Leu Thr Gly Ser Ser Gly Ser Phe Gin Ala Thr His Tyr Pro Lys 
230 235 240 245 

cct tct gaa aca agt gtt gtc tgc eag tgg ate ata cgt gta aac caa 
Pro Ser Glu Thr Ser Val Val Cys Gin Trp He He Arg Val Asn Gin 

250 255 260 



823 



871 



gga ctt tec att aaa ctg age ttc gat gat ttt aat aca tat tat aca 
Gly Leu Ser He Lys Leu Ser Phe Asp Asp Phe Asn Thr Tyr Tyr Thr 
265 270 275 

gat ata tta gat att tat gaa ggt gta gga tea age aag att tta aga 
ASP He Leu Asp He Tyr Glu Gly Val Gly Ser Ser Lys He Leu Arg 
280 285 290 

act tct att tgg gaa act aat cct ggc aca ata aga att ttt tec aac 
Ala Ser He Trp Glu Thr Asn Pro Gly Thr He Arg He Phe Ser Asn 
295 300 305 

caa gtt act gcc ace ttt ctt ata gaa tct gat gaa agt gat tat gtt 
Gin Val Thr Ala Thr Phe Leu He Glu Ser Asp Glu Ser Asp Tyr Val 
310 315 320 325 

qgc ttt aat gca aca tat act gca ttt aac age agt gag ctt aat aat 
Glv Phe Asn Ala Thr Tyr Thr Ala Phe Asn Ser Ser Glu Leu Asn Asn 
^ 330 335 340 

tat gag aaa att aat tgt aac ttt gag gat ggc ttt tgt ttc tgg gtc 
Tyr Glu Lys He Asn Cys Asn Phe Glu Asp Gly Phe Cys Phe Trp Val 
345 350 355 

cag gat eta aat gat gat aat gaa tgg gaa agg att cag gga age acc 1159 
Gin Asp Leu Asn Asp TUp Asn Glu Trp Glu Arg He Gin Gly Ser Thr 
360 365 370 

ttt tct cct ttt act gga ccc aat ttt gac cac act ttt ggc aat get 1207 
Phe Ser Pro Phe Thr Gly Pro Asn Phe Asp His Thr Phe Gly Asn Ala 
375 380 385 

tea gga ttt tac att tct acc cca act gga cca gga ggg aga caa gaa 1255 
Ser Gly Phe Tyr He Ser Thr Pro Thr Gly Pro Gly Gly Arg Gin Glu 
390 395 400 405 



919 



967 



1015 



1063 



1111 
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cga gtg ggg ctt tta age etc cct ttg gac ccc act ttg gag cca get 1303 
Arg Val Gly Leu Leu Ser Leu Pro Leu Asp Pro Thr Leu Glu Pro Ala 

410 415 420 

tgc ett agt ttc tgg tat cat atg tat ggt gaa aat gtc cat aaa tta 1351 
Cys Leu Ser Phe Trp Tyr His Met Tyr Gly Glu Asn Val His LyB Leu 
425 430 435 

age att aat ate age aat gac caa aat atg gag aag aca gtt ttc caa 1399 
Ser He Asn He Ser Asn Asp Gin Asn Met Glu Lys Thr Val Phe Gin 
440 445 450 

aag gaa gga aat tat gga gac aat tgg aat tat gga caa gta ace eta 1447 
Lys Glu Gly Asn Tyr Gly Asp Asn Trp Asn Tyr Gly Gin Val Thr Leu 
455 460 465 

aat gaa aca gtt aaa ttt aag gtt get ttt aat get ttt aaa aac aag 1495 
Asn Glu Thr Val Lys Phe Lys Val Ala Phe Asn Ala Phe Lys Asn Lys 
470 475 480 485 

ate ctg agt gat att gcg ttg gat gac att age eta aca tat ggg att 1543 
He Leu Ser Asp He Ala Leu Asp Asp He Ser Leu Thr Tyr Gly He 

490 495 500 

tgc aat ggg agt ctt tat cca gaa cca act ttg gtg cca act cet cca 1591 
Cys Asn Gly Ser Leu Tyr Pro Glu Pro Thr Leu Val Pro Thr Pro Pro 
505 510 515 

cca gaa ctt cct acg gac tgt gga gga cct ttt gag ctg tgg gag cca 1639 
Pro Glu Leu Pro Thr Asp Cys Gly Gly Pro Phe Glu Leu Trp Glu Pro 



gtg ctt etc ate act aac gat gtg ttg gca aga gga ggg ttt aaa gca 

Val Leu Leu He. "Thx Asn Asp Val Leu Ala Arg Gly Gly Phe Lys Ala 

615 620 625 

aac ttt act act ggc tat cae ttg ggg att cca gag cca tgc aag gca 

Asn Phe Thr Thr Gly Tyr His Leu Gly He Pro Glu Pro Cys Lys Ala 
630 635 640 645 

gac eat ttt caa tgt aaa aat gga gag tgt gtt cca ctg gtg aat etc 



1687 



520 525 530 

aat aca aca ttc agt tct acg aac ttt cca aac age tac cet aat ctg 

Asn Thr Thr Phe Ser Ser Thr Asn Phe Pro Asn Ser Tyr Pro Asn Leu 
535 540 545 

get ttc tgt gtt tgg att tta aat gca caa aaa gga aag aat ata caa 1735 

Ala Phe Cys Val Trp He Leu Asn Ala Gin Lys Gly Lys Asn He Gin 

550 555 560 565 

Ctt cat ttt caa gaa ttt gae tta gaa aat att aac gat gta gtt gaa 1783 

Leu His Phe Gin Glu Phe Asp Leu Glu Asn He Asn Asp Val Val Glu 

570 575 580 

ata aga gat ggt gaa gaa get gat tec ttg etc tta get gtg tac aca 1831 

He Arg Asp Gly Glu Glu Ala Asp Ser Leu Leu Leu Ala Val Tyr Thr 
585 590 595 

ggg cct ggc cca gta aag gat gtg ttc tct aec acc aac aga atg act 1879 

Gly Pro Gly Pro Val Lys Asp Val Phe Ser Thr Thr Asn Arg Met Thr 
600 605 610 



1927 



1975 



2023 
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Asp His Phe Gin Cys Lys Asn Gly Glu Cys Val Pro Leu Val Asn Leu 

650 655 660 

tgt gac ggt cat ctg cac tgt gag gat ggc tea gat gaa gca gat tgt 2071 
Cys Asp Gly His Leu His Cys Glu Asp Gly Ser Asp Glu Ala Asp Cys 
665 670 675 

gtg cgt ttt ttc aat ggc aca acg aac aac aat ggt tta gtg egg ttc 2119 
Val Airg Phe Phe Asn Gly Thr Thr Asn Asn Asn Gly Leu Val Arg Phe 
680 685 690 

aga ate cag age ata tgg cat aca get tgt get gag aac tgg acc ace 2167 
Arg He Gin Ser He Trp His Thr Ala Cys Ala Glu Asn Trp Thr Thr 
695 700 705 



cag att tea aat gat gtt tgt caa ctg ctg gga eta ggg agt gga aac 
Gin He Ser Asn Asp Val Cys Gin Leu Leu Gly Leu Gly Ser Gly Asn 
710 715 720 725 

tea tea aag cca ate ttc tct acc gat ggt gga cca ttt gtc aaa tta 
Ser Ser Lys Pro He Phe Ser Thr Asp Gly Gly Pro Phe Val Lys Leu 

730 735 740 



aca gca ate eta ggc ctg cat atg aaa tea aat ctg acc tct cct caa 

Thr Ala He Leu Gly Leu His Met Lys Ser Asn Leu Thr Ser Pro Gin 
840 845 850 

aca gtc cct ega tta ata gat gaa att gtc ata aac cct cat tac aat 

Thr Val Pro Arg Leu He Asp Glu He Val He Asn Pro Hie Tyr Asn 
855 860 865 

agg cga aga aag gac aac gac att gee atg atg cat ctg gaa ttt aaa 

Arg Arg Arg Lys Asp Asn Asp He Ala Met Met His Leu Glu Phe Lys 

870 875 880 885 

gtg aat tac aca gat tac ata caa cct att tgt tta ccg gaa gaa aat 

Val Asn Tyr Thr Asp Tyr He Gin Pro He Cys Leu Pro Glu Glu Asn 



2215 



2263 



aac aca gca cct gat ggc cac tta ata eta aca ccc agt caa cag tgt 2311 
Asn Thr Ala Pro Asp Gly His Leu He Leu Thr Pro Ser Gin Gin Cys 
745 750 755 

tta cag gat tec ttg att egg tta cag tgt aac cat aaa tct tgt gga 2359 
Leu Gin Asp Ser Leu He Arg Leu Gin Cys Asn His Lys Ser Cys Gly 
760 765 770 

aaa aaa ctg gca get caa gac ate acc cca aag att gtt gga gga agt 
Lys Lys Leu Ala Ala Gin Asp He Thr Pro Lys He Val Gly Gly Ser 
775 780 785 



2407 



aat gee aaa gaa ggg gcc tgg ccc tgg gtt gtg ggt ctg tat tat ggc 2455 
Asn Ala Lys Glu Gly Ala Trp Pro Trp Val Val Gly Leu Tyr Tyr Gly 
790 795 800 805 

ggc cga ctg etc tgc ggc gca tct etc gtc age agt gac tgg ctg gtg 2503 
Gly Arg Leu Leu Cys Gly Ala Ser Leu Val Ser Ser Asp Trp Leu Val 

810 815 820 

tec gee gca cac tgc gtg tat ggg aga aac tta gag cca tec aag tgg 2551 
Ser Ala Ala His Cys Val Tyr Gly Arg Asn Leu Glu Pro Ser Lys Trp 
825 830 835 



2599 



2647 



2695 



2743 
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890 



895 



900 



caa gtt ttt cct cca gga aga aat tgt tct att get ggt tgg ggg acg 
Gin Val Phe Pro Pro Gly Arg Asn Cys Ser He Ala Gly Trp Gly Thr 
905 910 915 

gtt gta tat caa ggt act act gca aac ata ttg caa gaa get gat gtt 
Val Val Tyr Gin Gly Thr Thr Ala Asn He Leu Gin Glu Ala Asp Val 
920 925 930 

cct ctt eta tea aat gag aga tgc caa cag cag atg cca gaa tat aac 
Pro Leu Leu Ser Asn Glu Arg Cys Gin Gin Gin Met Pro Glu Tyr Asn 
935 940 945 

att act gaa aat atg ata tgt gca ggc tat gaa gaa gga gga ata gat 
He Thr Glu Asn Met He Cys Ala Gly Tyr Glu Glu Gly Gly He Asp 
950 955 9S0 965 

tct tgt cag ggg gat tea gga gga cca tta atg tgc caa gaa aac aac 
Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Met Cys Gin Glu Asn Asn 

970 975 980 

agg tgg ttc ctt get ggt gtg acc tea ttt gga tac aag tgt gee ctg 
Arg Trp Phe Leu Ala Gly Val Thr Ser Phe Gly Tyr Lys Cys Ala Leu 
985 990 995 

cct aat cgc ccc gga gtg tat gcc agg gte tea agg ttt acc gaa tgg 
Pro Asn Arg Pro Gly Val Tyr Ala Arg Val Ser Axg Phe Thr Glu Trp 
1000 1005 1010 

ata caa agt ttt eta cat tag cgcatttett aaactaaaca ggaaagtcgc 
He Gin Ser Phe Leu His * 
1015 



attattttcc 
aaaagttacc 
acaaaatttt 
taaataeatt 
cttaatgatt 
catacactta 
aagtatgtca 
ctattagcag 
aacettagta 
gaetaaattg 



cattctaete 
aaaggttttt 
aaaaataata 
tgtatttcat 
atttttatta 
agaaatttga 
ctgttggaaa 
aaactcaatt 
ttttcccaac 
attttacgtt 



tagaaagcat 
attettacct 
aaattcacca 
tgtgaaeagg 
cttaetgttg 
gcagaattta 
taaactgeca 
gtttctctgt 
eaatagaaac 
cctctt 



ggaaattaag 
atgtcaatga 
tagcaataca 
tatttcttca 
tttaaaggga 
aaaaagaaag 
taaattttct 
cttttetatc 
tatttattgt 



tgtttcgtac 
aatgctaggg 
gaataaettt 
c agate teat 
tgttatttta 
aaaataaatt 
agttceagtt 
aaaattttea 
aagcttatgt 



aaaaatttta 
ggccagggaa 
aaaataccat 
ttttaaaatt 
aagcatatac 
gtttttccca 
tagtttgctg 
acatatgcat 
caeaggcetg 



2791 



2839 



2B87 



2935 



2983 



3031 



3079 



3130 



3190 
3250 
3310 
3370 
3430 
3490 
3550 
3610 
3670 
3696 



<210> 31 

<211> 1019 

<212> PRT 

<213> Homo Sapien 

<400> 31 

Met Gly Ser Lys Arg Gly He Ser Ser Arg His Hxs Ser Leu Ser Ser 

1 5 10 15 

Tyr Glu He Met Phe Ala Ala Leu Phe Ala He Leu Val Val Leu Cys 

20 25 30 

Ala Gly Leu He Ala Val Ser Cys Leu Thr He Lys Glu Ser Gin Arg 

35 40 45 

Gly Ala Ala Leu Gly Gin Ser His Glu Ala Arg Ala Thr Phe Lys He 

50 55 60 

Thr Ser Gly Val Thr Tyr Asn Pro Asn Leu Gin Asp Lyg Leu Ser Val 
65 70 75 80 
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Asp Phe Lys Val Leu Ala Phe Asp Leu Gin Gin Met lie Asp Glu lie 

85 90 95 

Phe Leu Ser Ser Asn Leu Lys Asn Glu Tyr Lys Aan Ser Arg Val Leu 

100 105 110 

Gin Phe Glu Asn Gly Ser lie He Val Val Phe Asp Leu Phe Phe Ala 

H5 120 125 

Gin Trp Val Ser Asp Gin Asn Val Lys Glu Glu Leu He Gin Gly Leu 

130 135 140 

Glu Ala Asn Lys Ser Ser Gin Leu Val Thr Phe His He Asp Leu Asn 
145 ISO 155 160 

Ser Val Asp He Leu Asp Lys Leu Thr Thr Thr Ser His Leu Ala Thr 

165 170 175 

Pro Gly Asn Val Ser He Glu Cys Leu Pro Gly Ser Ser Pro Cys Thr 

180 185 190 

Asp Ala Leu Thr Cys He Lys Ala Asp Leu Phe Cys Asp Gly Glu Val 

195 200 205 

Asn Cys Pro Asp Gly Ser Asp Glu Asp Asn Lys Met Cys Ala Thr Val 

210 215 220 

Cys Asp Gly Arg Phe Leu Leu Thr Gly Ser Ser Gly Ser Phe Gin Ala 
225 230 235 240 

Thr His Tyr Pro Lys Pro Ser Glu Thr Ser Val Val Cys Gin Trp He 

245 250 255 

He Arq Val Asn Gin Gly Leu Ser He Lys Leu Ser Phe Asp Asp Phe 

260 265 270 

Asn Thr Tyr Tyr Thr Asp He Leu Asp He Tyr Glu Gly Val Gly Ser 

275 280 285 

Ser Lys He Leu Arg Ala Ser He Trp Glu Thr Asn Pro Gly Thr He 

290 295 300 

Arg He Phe Ser Asn Gin Val Thr Ala Thr Phe Leu He Glu Ser Asp 
305 310 315 320 

Glu Ser Asp Tyr Val Gly Phe Asn Ala Thr Tyr Thr Ala Phe Asn Ser 

325 330 335 

Ser Glu Leu Asn Asn Tyr Glu Lys He Asn Cys Asn Phe Glu Asp Gly 

340 345 350 

Phe Cys Phe Trp Val Gin Asp Leu Asn Asp Asp Asn Glu Trp Glu Arg 

355 360 365 

He Gin Gly Ser Thr Phe Ser Pro Phe Thr Gly Pro Asn Phe Asp His 

370 375 380 

Thr Phe Gly Asn Ala Ser Gly Phe Tyr He Ser Thr Pro Thr Gly Pro 
385 390 395 400 

Gly Gly Arg Gin Glu Arg Val Gly Leu Leu Ser Leu Pro Leu Asp Pro 

405 410 415 

Thr Leu Glu Pro Ala Cys Leu Ser Phe Trp Tyr His Met Tyr Gly Glu 

420 425 430 

Asn Val His Lys Leu Ser He Asn He Ser Asn Asp Gin Asn Met Glu 

435 440 445 

Lys Thr Val Phe Gin Lys Glu Gly Asn Tyr Gly Asp Aen Trp Asn Tyr 

450 455 460 

Glv Gin Val Thr Leu Asn Glu Thr Val Lys Phe Lys Val Ala Phe Asn 
4S5 470 475 480 

Ala Phe Lys Asn Lys He Leu Ser Asp He Ala Leu Asp Asp He Ser 

485 / 490 495 

Leu Thr Tyr Gly He Cys Asn Gly Ser Leu Tyr Pro Glu Pro Thr Leu 

500 505 510 

Val Pro Thr Pro Pro Pro Glu Leu Pro Thr Asp Cys Gly Gly Pro Phe 

515 520 525 

Glu Leu Trp Glu Pro Asn Thr Thr Phe Ser Ser Thr Asn Phe Pro Asn 

530 535 540 

Ser Tyr Pro Asn Leu Ala Phe Cys Val Trp He Leu Asn Ala Gin Lys 
545 550 555 560 

Gly Lys Asn He Gin Leu His Phe Gin Glu Phe Asp Leu Glu Asn He 
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ses 570 575 

Asn Asp Val Val Glu lie Arg Asp Gly Glu Glu Ala Asp Ser Leu Leu 

580 585 590 

Leu Ala Val Tyr Thr Gly Pro Gly Pro Val Lys Asp Val Phe Ser Thr 

595 600 605 

Thr Asn Arg Met Thr Val Leu Leu lie Thr Asn Asp Val Leu Ala Arg 

610 615 620 

Gly Gly Phe Lys Ala Asn Phe Thr Thr Gly Tyr His Leu Gly lie Pro 
625 630 635 640 

Glu Pro Cys Lys Ala Asp His Phe Gin Cys Lys Asn Gly Glu Cys Val 

645 650 655 

Pro Leu Val Asn Leu Cys Asp Gly His Leu His Cys Glu Asp Gly Ser 

660 665 670 

Asp Glu Ala Asp Cys Val Arg Phe Phe Asn Gly Thr Thr Asn Asn Asn 

675 680 685 

Gly Leu Val Arg Phe Arg lie Gin Ser lie Trp His Thr Ala Cys Ala 

690 695 700 

Glu Asn Trp Thr Thr Gin lie Ser Asn Asp Val Cys Gin Leu Leu Gly 
705 710 715 720 

Leu Gly Ser Gly Asn Ser Ser Lys Pro lie Phe Ser Thr Asp Gly Gly 

725 730 735 

Pro Phe Val Lys Leu Asn Thr Ala Pro Asp Gly His Leu lie Leu Thr 

740 745 750 

Pro Ser Gin Gin Cys Leu Gin Asp Ser Leu lie Arg Leu Gin Cys Asn 

755 760 765 

His Lys Ser Cys Gly Lys Lys Leu Ala Ala Gin Asp He Thr Pro Lys 

770 775 780 

He Val Gly Gly Ser Asn Ala Lys Glu Gly Ala Trp Pro Trp Val Val 
785 790 795 800 

Gly Leu Tyr Tyr Gly Gly Arg Leu Leu Cys Gly Ala Ser Leu Val Ser 

805 810 815 

Ser Asp Trp Leu Val Ser Ala Ala His Cys Val Tyr Gly Arg Asn Leu 

820 825 830 

Glu Pro Ser Lys Trp Thr Ala He Leu Gly Leu His Met Lys Ser Asn 

835 840 845 

Leu Thr Ser Pro Gin Thr Val Pro Arg Leu He Asp Glu He Val He 

850 855 860 

Asn Pro His Tyr Asn Arg Arg Arg Lys Asp Asn Asp He Ala Met Met 
865 870 875 880 

His Leu Glu Phe Lys Val Asn Tyr Thr Asp Tyr He Gin Pro He Cys 

885 890 895 

Leu Pro Glu Glu Asn Gin Val Phe Pro Pro Gly Arg Aen Cys Ser He 

900 905 910 

Ala Gly Trp Gly Thr Val Val Tyr Gin Gly Thr Thr Ala Asn He Leu 

915 920 925 

Gin Glu Ala Asp Val Pro Leu Leu Ser Asn Glu Arg Cys Gin Gin Gin 

930 935 940 

Met Pro Glu Tyr Asn He Thr Glu Asn Met He Cys Ala Gly Tyr Glu 
945 950 955 960 

Glu Gly Gly He Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Met 

965 970 975 

Cvs Gin Glu Asn Asn Arg Trp Phe Leu Ala Gly Val Thr Ser Phe Gly 

980 985 990 

TVr Lys Cys Ala Leu Pro Asn Arg Pro Gly Val Tyr Ala Arg Val Ser 

995 1000 1005 

Arg Phe Thr Glu Trp He Gin Ser Phe Leu His 
1010 1015 

<210> 32 
<211> 1500 
<212> DNA 
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<213> Homo Sapien 

<220> 
<221> CDS 

<222> (62) . . . (1318) 

<223> Nucleotide sequence encoding human airway 
trypsin- like protease 

<300> 

<308> GenBank AB002134 
<309> 1998-06-04 

<400> 32 

gagtgggaat ctcaaagcag ttgagtaggc agaaaaaaga acctcttcat taaggattaa 
a atg tat agg cca gca cgt gta act teg act tea aga ttt ctg aat oca 
Met Tyr Arg Pro Ala Arg Val Thr Ser Thr Ser Arg Phe Leu Asn Pro 
15 10 15 



60 
109 



tat gta gta tgt ttc att gtc gtc gca ggg gta gtg ate ctg gca gtc 157 
Tyr val Val Cys Phe lie Val Val Ala Gly Val Val lie Leu Ala Val 
20 25 30 

acc ata get eta ctt gtt tac ttt tta get ttt gat caa aaa tct tac 
Thr lie Ala Leu Leu Val Tyr Phe Leu Ala Phe Asp Gin Lys Ser Tyr 
35 40 45 



205 



301 



349 



ttt tat agg age agt ttt caa etc eta aat gtt gaa tat aat agt cag 253 
Phe Tyr Arg Ser Ser Phe Gin Leu Leu Asn Val Glu Tyr Asn Ser Gin 
50 55 60 

tta aat tea cca get aca cag gaa tac agg act ttg agt gga aga att 
Leu Asn Ser Pro Ala Thr Gin Glu Tyr Arg Thr Leu Ser Gly Arg lie 
65 70 75 80 

gaa tct ctg att act aaa aca ttc aaa gaa tea aat tta aga aat cag 
Glu Ser Leu He Thr Lys Thr Phe Lys Glu Ser Asn Leu Arg Asn Gin 

85 90 95 

ttc ate aga get cat gtt gcc aaa ctg agg caa gat ggt agt ggt gtg 397 
Phe He Arg Ala His Val Ala Lys Leu Arg Gin Asp Gly Ser Gly Val 
100 105 110 

aga gcg gat gtt gtc atg aaa ttt caa ttc act aga aat aac aat gga 445 
Arq Ala Asp Val Val Met Lys Phe Gin Phe Thr Arg Asn Asn Asn Gly 
115 120 125 

gca tea atg aaa age aga att gag tct gtt tta ega caa atg ctg aat 493 
Ala Ser Met Lys Ser Arg He Glu Ser Val Leu Arg Gin Met Leu Asn 
130 135 140 

aac tct gga aac ctg gaa ata aac ect tea act gag ata aca tea ctt 541 
Asn Ser Gly Asn Leu Glu He Asn Pro Ser Thr Glu He Thr Ser Leu 
145 150 155 160 

act gac cag get gca gca aat tgg ctt att aat gaa tgt ggg gcc ggt 589 
Thr Asp Gin Ala Ala Ala Asn Trp Leu He Asn Glu Cys Gly Ala Gly 

165 170 175 

cca gac eta ata aca ttg tct gag cag aga ate ett gga gge act gag 637 
Pro Asp Leu He Thr Leu Ser Glu Gin Arg He Leu Gly Gly Thr Glu 
180 185 190 
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gct gag gag gga age tgg ccg tgg caa gtc agt ctg egg etc aat aat 
Ala Glu Glu Gly Ser Trp Pro Trp Gin Val Ser Leu Arg Leu Asn Asn 
195 200 205 



585 



gcc cac cac tgt gga ggc age ctg ate aat aac atg tgg ate ctg aca 73 3 

Ala Hi a His Cys Gly Gly Ser Leu He Asn Asn Met Trp He Leu Thr 
210 215 220 

gca get cac tgc ttc aga age aac tct aat cct cgt gac tgg att gcc 
Ala Ala Hie Cys Phe Arg Ser Aen Ser Asn Pro Arg Asp Trp He Ala 
225 230 235 240 



781 



925 



973 



acg tct ggt att tee aca aca ttt cct aaa eta aga atg aga gta aga 829 
Thr Ser Gly He Ser Thr Thr Phe Pro Lys Leu Arg Met Arg Val Arg 

245 250 255 

aat att tta att cat aac aat tat aaa tct gca act cat gaa aat gac 877 
Asn He Leu He His Asn Asn Tyr Lys Ser Ala Thr His Glu Asn Asp 
260 265 270 

att gca ctt gtg aga ctt gag aac agt gtc acc ttt ace aaa gat ate 
He Ala Leu Val Arg Leu Glu Asn Ser Val Thr Phe Thr Lys Asp He 
275 280 285 

cat agt gtg tgt etc cca get get acc cag aat att cca cct ggc tct 
His ser Val Cys Leu Pro Ala Ala Thr Gin Asn He Pro Pro Gly Ser 
290 295 300 

act get tat gta aca gga tgg ggc get caa gaa tat get ggc cac aca 1021 
Thr Ala Tyr Val Thr Gly Trp Gly Ala Gin Glu Tyr Ala Gly His Thr 
305 310 315 320 

Qtt cca gag eta agg caa gga cag gtc aga ata ata agt aat gat gta 1069 
val Pro Glu Leu Arg Gin Gly Gin Val Arg He He Ser Asn Asp Val 

325 330 335 

tgt aat gca cca cat agt tat aat gga gcc ate ttg tct gga atg ctg 1117 
Cvs Asn Ala Pro His Ser Tyr Asn Gly Ala He Leu Ser Gly Met Leu 
340 345 350 

tgt get gga gta cct caa ggt gga gtg gac gca tgt cag ggt gac tct 1165 
Cys Ala Gly Val Pro Gin Gly Gly Val Asp Ala Cys Gin Gly Asp Ser 
355 360 365 

ggt ggc cca eta gta caa gaa gac tea egg egg ctt tgg ttt att gtg 1213 
Gly Gly Pro Leu Val Gin Glu Asp Ser Arg Arg Leu Trp Phe He Val 
370 375 380 

ggg ata gta age tgg gga gat cag tgt ggc ctg ccg gat aag cca gga 1261 
Gly He Val Ser Trp Gly Asp Gin Cys Gly Leu Pro Asp Lys Pro Gly 
385 390 395 400 

gtg tat act cga gtg aca gcc tac ctt gac tgg att agg caa caa act 13 09 

Val Tyr Thr Arg Val Thr Ala Tyr Leu Asp Trp He Arg Gin Gin Thr 

405 410 415 

ggg ate tag tgcaacaagt gcatccctgt tgcaaagtct gtatgcaggt 1358 
Gly He * 
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gtgcctgtct taaattccaa agctttacat ttcaactgaa aaagaaacta gaaatgtcct 1418 
aatttaacat cttgttacat aaatatggtt taacaaacac tgtttaacct ttctttatta 1478 
ttaaaggttt tctattttct cc 1500 

<210> 33 

<211> 418 

<212> PRT 

<213> Homo Sapien 

<400> 33 

Met Tyr Arg Pro Ala Arg Val Thr Ser Thr Ser Arg Phe Leu Asn Pro 

15 10 15 

Tyr Val Val Cys Phe He Val Val Ala Gly Val Val He Leu Ala Val 

20 25 30 

Thr He Ala Leu Leu Val Tyr Phe Leu Ala Phe Asp Gin Lys Ser Tyr 

35 40 45 

Phe Tyr Arg Ser Ser Phe Gin Leu Leu Aen Val Glu Tyr Asn Ser Gin 

SO 55 60 

Leu Asn Ser Pro Ala Thr Gin Glu Tyr Arg Thr Leu Ser Gly T^g He 
65 70 75 80 

Glu Ser Leu He Thr Lys Thr Phe Lys Glu Ser Asn Leu Arg Asn Gin 

85 90 95 

Phe He Arg Ala Hie Val Ala Lys Leu Arg Gin Asp Gly Ser Gly Val 

100 105 110 

Arg Ala Asp Val Val Met Lys Phe Gin Phe Thr Arg Asn Asn Asn Gly 

115 120 125 

Ala Ser Met Lys Ser Arg He Glu Ser Val Leu Arg Gin Met Leu Asn 

130 135 140 

Asn Ser Gly Asn Leu Glu He Asn Pro Ser Thr Glu He Thr Ser Leu 
145 150 155 160 

Thr Asp Gin Ala Ala Ala Asn Trp Leu He Asn Glu Cys Gly Ala Gly 

165 170 175 

Pro Asp Leu He Thr Leu Ser Glu Gin Arg He Leu Gly Gly Thr Glu 

180 185 190 

Ala Glu Glu Gly Ser Trp Pro Trp Gin Val Ser Leu Arg Leu T^n Asn 

195 200 205 

Ala His His Cys Gly Gly Ser Leu He Asn Asn Met Trp He Leu Thr 

210 215 ' 220 

Ala Ala His Cys Phe Arg Ser Asn Ser Asn Pro Arg Asp Trp He Ala 
225 230 235 240 

Thr Ser Gly He Ser Thr Thr Phe Pro Lys Leu Arg Met Arg Val Arg 

245 250 255 

Asn He Leu He His Asn Asn Tyr Lys Ser Ala Thr His Glu Asn Asp 

260 265 270 

He Ala lieu Val Arg Leu Glu Asn Ser Val Thr Phe Thr Lys Asp He 

275 280 285 

His Ser Val Cys Leu Pro Ala Ala Thr Gin Asn He Pro Pro Gly Ser 

290 295 300 

Thr Ala Tyr Val Thr Gly Trp Gly Ala Gin Glu Tyr Ala Gly His Thr 
305 310 315 320 

Val Pro Glu Leu Arg Gin Gly Gin Val Arg He He Ser Asn Asp Val 

325 330 335 

Cys Asn Ala Pro His Ser Tyr Asn Gly Ala He Leu Ser Gly Met Leu 

340 345 350 

Cys Ala Gly Val Pro Gin Gly Gly Val Asp Ala Cys Gin Gly Asp Ser 

355 360 365 

Gly Gly Pro Leu Val Gin Glu Asp Ser Arg T^rg Leu Trp Phe He Val 

370 375 380 

Gly He Val Ser Trp Gly Asp Gin Cys Gly Leu Pro Asp Lys Pro Gly 
385 390 395 400 

Val Tyr Thr Arg Val Thr Ala Tyr Leu Asp Trp He Arg Gin Gin Thr 
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405 410 415 

Gly lie 

<210> 34 

<211> 1783 

<212> DNA 

<213> Homo Sapien 

<220> 

<221> CDS 

<222> (246) . . . (1499) 

<223> Nucleic acid encoding human hepsin 
<300> 

<308> GenBank M18930 

<309> 1993-06-11 

<400> 34 

tcgagcccgc tttccaggga ccctacctga gggcccacag gtgaggcagc ctggcctagc 60 

aggccccacg ccaccgcctc tgcctccagg ccgcccgctg ctgcggggcc accatgctcc 120 

tgcccaggcc tggagactga cccgaccccg gcactacctc gaggctccgc ccccacctgc 180 

tggaccccag ggtcccaccc tggcccagga ggtcagccag ggaatcatta acaagaggca 240 

gtgac atg gcg cag aag gag ggt ggc egg act gtg cca tgc tgc tec aga 290 

Met Ala Gin Lys Glu Gly Gly Arg Thr Val Pro Cys Cys Ser Arg 
15 10 15 

ccc aag gtg gca get etc act gcg ggg acc ctg eta ctt ctg aca gcc 33 8 

Pro Lys Val Ala Ala Leu Thr Ala Gly Thr Leu Leu Leu Leu Thr Ala 

20 25 30 

ate ggg gcg gca tec tgg gee att gtg get gtt etc etc agg agt gac 
lie Gly Ala Ala Ser Trp Ala lie Val Ala Val Leu Leu Arg Ser Asp 
35 40 45 

cag gag ceg ctg tac cca gtg cag gtc age tct gcg gac get egg etc 
Gin Glu Pro Leu Tyr Pro Val Gin Val Ser Ser Ala Asp Ala Arg Leu 
50 55 60 

atg gtc ttt gac aag acg gaa ggg acg tgg egg ctg ctg tgc tee teg 
Met Val Phe Asp Lys Thr Glu Gly Thr Trp Arg Leu Leu Cys Ser Ser 
65 70 75 

cgc tec aac gee agg gta gcc gga etc age tgc gag gag atg ggc tte 
Arg Ser Asn Ala Arg Val Ala Gly Leu Ser Cys Glu Glu Met Gly Phe 
80 85 90 95 

etc agg gca ctg acc cac tec gag ctg gae gtg cga acg gcg ggc gcc 
Leu Arg Ala Leu Thr His Ser Glu Leu Asp Val Arg Thr Ala Gly Ala 

100 105 110 

aat ggc acg teg ggc tte tte tgt gtg gac gag ggg agg ctg ccc cae 
Asn Gly Thr Ser Gly Phe Phe Cys Val Asp Glu Gly Arg Leu Pro His 
115 120 125 

acc cag agg ctg ctg gag gtc ate tec gtg tgt gat tgc ccc aga ggc 674 
Thr Gin Arg Leu Leu Glu Val lie Ser Val Cys Asp Cys Pro Arg Gly 
130 135 140 



386 



434 



482 



530 



578 



626 



cgt tte ttg gee gee ate tgc eaa gae tgt ggc cgc agg aag ctg ccc 
Arg Phe Leu Ala Ala He Cys Gin Asp Cys Gly Arg Arg Lys Leu Pro 
145 150 155 



722 



r 
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gtg gac cgc ate gtg gga ggc egg gac acc age ttg ggc egg tgg ecg 770 
Val Asp Arg lie Val Gly Gly Arg Asp Thr Ser Leu Gly Arg Trp Pro 
160 165 170 175 

tgg caa gtc age ett cge tat gat gga gca cac etc tgt ggg gga tec 818 
Trp Gin Val Ser Leu Arg Tyr Asp Gly Ala His Leu Cys Gly Gly Ser 

180 185 190 

ctg etc tec ggg gac tgg gtg ctg aca gee gee eae tge ttc eeg gag 866 
Leu Leu Ser Gly Asp Trp Val Leu Thr Ala Ala His Cys Phe Pro Glu 
195 200 205 

egg aac egg gte etg tec cga tgg ega gtg ttt gee ggt gcc gtg gcc 914 
Arg Asn Arg Val Leu Ser Arg Trp Arg Val Phe Ala Gly Ala Val Ala 
210 215 220 

eag gcc tct cce cac ggt ctg cag ctg ggg gtg cag get gtg gtc tac 962 
Gin Ala Ser Pro His Gly Leu Gin Leu Gly Val Gin Ala Val Val Tyr 
225 230 235 

cac ggg ggc tat ctt cce ttt egg gac cce aac age gag gag aac age lOlO 
His Gly Gly Tyr Leu Pro Phe Arg Asp Pro Asn Ser Glu Glu Asn Ser 
240 245 250 255 



aac gat att gcc ctg gte eae etc tec agt cce ctg cce etc aca gaa 
Asn Asp lie Ala Leu Val His Leu Ser Ser Pro Leu Pro Leu Thr Glu 

260 265 270 



gac age ggt ggt cce ttt gtg tgt gag gac age ate tct egg aeg cca 
Asp Ser Gly Gly Pro Phe Val Cys Glu Asp Ser lie Ser Arg Thr Pro 
355 360 365 



1058 



tae ate cag cct gtg tgc etc cca get gcc ggc cag gee ctg gtg gat 1106 
Tyr lie Gin Pro Val Cys Leu Pro Ala Ala Gly Gin Ala Leu Val Asp 
275 280 285 

ggc aag ate tgt ace gtg aeg ggc tgg ggc aac acg cag tac tat ggc 1154 
Gly Lys lie Cys Thr Val Thr Gly Trp Gly Asn Thr Gin Tyr Tyr Gly 
290 295 300 

caa cag gee ggg gta etc eag gag get cga gte cec ata ate age aat 1202 
Gin Gin Ala Gly Val Leu Gin Glu Ala Arg Val Pro lie lie Ser Asn 
305 310 315 

gat gte tgc aat gge get gac ttc tat gga aac eag ate aag cce aag 1250 
Asp Val Cys Asn Gly Ala Asp Phe Tyr Gly Asn Gin lie Lys Pro Lys 
320 325 330 335 

atg ttc tgt get ggc tac cce gag ggt ggc att gat gee tge cag ggc 1298 
Met Phe Cys Ala Gly Tyr Pro Glu Gly Gly lie Asp Ala Cys Gin Gly 

340 345 350 



1346 



cgt tgg egg ctg tgt ggc att gtg agt tgg ggc act gge tgt gee etg 1394 
Arg Trp Arg Leu Cys Gly lie Val Ser Trp Gly Thr Gly Cys Ala Leu 
370 375 380 

gcc cag aag cca ggc gtc tae acc aaa gtc agt gac ttc egg gag tgg ^ 1442 
Ala Gin Lys Pro Gly Val Tyr Thr Lys Val Ser Asp Phe Arg Glu Trp 
385 390 395 
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atc ttc cag gcc ata aag act cac tec gaa gcc age ggc atg gtg acc 1490 
He Phe Gin Ala He Lys Thr His Ser Glu Ala Ser Gly Met Val Thr 
400 405 410 415 

cag etc tga ccggtggctt ctcgctgcgc agcctccagg gcccgaggtg 153 9 

Gin Leu * 



atcccggtgg tgggatccac gctgggccga ggatgggacg tttttcttct tgggcccggt 1599 

ccacaggtcc aaggacaccc tccctccagg gtcctctctt ecacagtgge gggcccactc 1659 

agccccgaga ccacccaacc tcaccctcct gacccccatg taaatattgt tctgctgtct 1719 

gggactcctg tctaggtgcc cctgatgatg ggatgctctt taaataataa agatggtttt 1779 

gatt 1783 

<210> 35 

<211> 417 

<212> PRT 

<213> Homo Sapien 



<400> 35 



Met Ala Gin 


Lys 


Glu 


Gly 


Gly 


Arg 


Thr 


Val 


Pro Cys 


Cys 


Ser 


Arg 


Pro 


1 


5 






10 








15 




Lye Val Ala 


Ala 


Leu 


Thr 


Ala 


Gly 


Thr 


Leu 


Leu Leu 


Leu 


Thr 


Ala 


He 


20 










2b 














Gly Ala Ala 


Ser 


Trp 


Ala 


He 


Val 


Ala 


Val 


Leu Leu 


Arg 


Ser 


Asp 


Gin 


35 








40 








45 








Glu Pro Leu 


Tyr 


Pro 


Val 


Gin 


Val 


Ser 


Ser 


Ala Asp 


Ala 


Arg 


Leu 


Met 


50 






55 








60 










Val Phe Asp 


Lys 


Thr 


Glu 


Gly 


Thr 


Trp 


Arg 


Leu Leu 


Cys 


Ser 


Ser 


Arg 


65 




70 










75 








80 


Ser Asn Ala 


Arg 


Val 


Ala 


Gly 


Leu 


Ser 


Cys 


Glu Glu 


Met 


Gly 


Phe 


Leu 




85 










90 








95 




Arg Ala Leu 


Thr 


His 


Ser 


Glu 


Leu 


Asp 


Val 


Arg Thr 


Ala 


Gly 


Ala 


Asn 


100 










105 








110 




Thr 


Gly Thr Ser 


Gly 


Phe 


Phe 


Cys 


Val 


Asp 


Glu 


Gly Arg 


Leu 


Pro 


His 


115 








120 








125 








Gin Arg Leu 


Leu 


Glu 


Val 


He 


Ser 


Val 


Cys 


Asp Cys 


Pro 


Arg 


Gly 


Arg 


13 0 








135 








140 








Val 


Phe Ixeu Ala 


Ala 


He 


Cys 


Gin 


Asp 


Cys 


Gly 


Arg Arg 


Lys 


Leu 


Pro 


145 






150 










155 








160 


Asp Arg He 


Val 


Gly 


Gly 


Arg 


Asp 


Thr 


Ser 


Leu Gly 


Arg 


Trp 


Pro 


Trp 




165 










170 








175 




Gin Val Ser 


Leu 


Arg 


Tyr 


Asp 


Gly 


Ala 


His 


Leu Cys 


Gly 


Gly 


Ser 


Leu 




180 






165 








190 






Leu Ser Gly 


Asp 


Trp 


Val 


Leu 


Thr 


Ala 


Ala 


His Cys 


Phe 


Pro 


Glu 


Arg 


195 








200 








205 








Asn Arg Val 


Leu 


Ser 


Arg 


Trp 


Arg 


Val 


Phe 


Ala Gly 


Ala 


Val 


Ala 


Gin 


210 








215 








220 








His 


Ala Ser Pro 


His 


Gly 


Leu 


Gin 


Leu 


Gly 


Val 


Gin Ala 


Val 


Val 


Tyr 


225 




230 










235 








240 


Gly Gly Tyr 


Leu 


Pro 


Phe 


Arg 


Asp 


Pro 


Asn 


Ser Glu 


Glu 


Asn 


Ser 


Asn 




245 










250 








255 




Asp He Ala 


Leu 


Val 


His 


Leu 


Ser 


Ser 


Pro 


Leu Pro 


Leu 


Thr 


Glu 


Tyr 


260 










265 








270 






He Gin Pro 


Val 


Cys 


Leu 


Pro 


Ala 


Ala 


Gly 


Gin Ala 


Leu 


Val 


Asp 


Gly 


275 








2B0 








285 








Lys He Cys 


Thr 


Val 


Thr 


Gly 


Tip 


Gly 


Asn 


Thr Gin 


Tyr 


Tyr 


Gly 


Gin 


290 








295 








300 










Gin Ala Gly 


Val 


Leu 


Gin 


Glu 


Ala 


Arg 


Val 


Pro He 


He 


Ser 


Asn 


Asp 


305 






310 










315 








320 


Val Cys Asn 


Gly 


Ala 


Asp 


Phe 


Tyr 


Gly 


Asn 


Gin He 


Lys 


Pro 


Lys 


Met 
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325 ' 330 335 

Phe Cys Ala Gly Tyr Pro Glu Gly Gly lie Asp Ala Cys Gin Gly Asp 

340 345 350 

Ser Gly Gly Pro Phe Val Cys Glu Asp Ser lie Ser Arg Thr Pro Arg 

355 360 365 

Trp Arg Leu Cys Gly He Val Ser Trp Gly Thr Gly Cys Ala Leu Ala 

370 375 380 

Gin Lys Pro Gly Val Tyr Thr Lys Val Ser Asp Phe Arg Glu Trp He 
385 390 395 400 

Phe Gin Ala He Lys Thr His Ser Glu Ala Ser Gly Met Val Thr Gin 

405 410 415 

Leu 

<210> 36 

<211> 2479 

<212> DNA 

<213> Homo sapien 

<220> 
<221> CDS 

<222> (57) . . . (1535) 

<223> Nucleotide sequence encoding human serine protease (TMPRS2) 
<300> 

<308> GenBank U75329 
<309> 1997-10-10 

<400> 36 

gtcatattga acattccaga tacctatcat tactcgatgc tgttgataac agcaag atg 59 

Met 
1 



get ttg aac tea ggg tea cca cca get att gga ect tac tat gaa aac 
Ala Leu Asn Ser Gly Ser Pro Pro Ala He Gly Pro Tyr Tyr Glu Asn 
5 10 15 



cec act gtc tac gag gtg eat ceg get cag tac tac ccg tec cec gtg 
Pro Thr Val Tyr Glu Val His Pro Ala Gin Tyr Tyr Pro Ser Pro Val 
35 40 45 

cec cag tac gee ccg agg gtc ctg acg cag get tec aac cec gtc gtc 
Pro Gin Tyr Ala Pro Arg Val Leu Thr Gin Ala Ser Asn Pro Val Val 
50 55 60 65 

tgc acg cag cec aaa tec cca tec ggg aca gtg tgc ace tea aag act 
Cys Thr Gin Pro Lys Ser Pro Ser Gly Thr Val Cys Thr Ser Lys Thr 

70 75 80 



107 



cat gga tac caa ccg gaa aac cec tat cec gca cag cec act gtg gte 155 
His Gly Tyr Gin Pro Glu Asn Pro Tyr Pro Ala Gin Pro Thr Val Val 
20 25 30 



203 



251 



299 



aag aaa gca ctg tgc ate ace ttg ace ctg ggg ace ttc etc gtg gga 347 

Lys Lys Ala Leu Cys He Thr Leu Thr Leu Gly Thr Phe Leu Val Gly 

85 90 95 

get gcg ctg gcc get ggc eta etc tgg aag ttc atg ggc age aag tgc 3 95 

Ala Ala Leu Ala Ala Gly Leu Leu Trp Lys Phe Met Gly Ser Lys Cys 

100 105 110 

tec aac tct ggg ata gag tgc gac tec tea ggt ace tgc ate aac cec 443 
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Ser Asn Ser Gly lie Glu Cys Asp Ser Ser Gly Thr Cys lie Asn Pro 
115 120 125 

tct aac tgg tgt gat ggc gtg tea cac tgc ccc ggc ggg gag gac gag 491 
Ser Asn Trp Cys Asp Gly Val Ser His Cys Pro Gly Gly Glu Asp Glu 
130 135 140 145 

aat egg tgt gtt cgc etc tac gga cca aac ttc ate ctt cag atg tac 539 
Asn Arg Cys Val Arg Leu Tyr Gly Pro Asn Phe He Leu Gin Met Tyr 

150 155 160 

tea tct cag agg aag tec tgg cac cct gtg tgc caa gac gac tgg aac 587 
Ser Ser Gin Arg Lys Ser Trp His Pro Val Cys Gin Asp Asp Trp Asn 
165 170 175 

gag aac tac ggg egg geg gcc tgc agg gac atg ggc tat aag aat aat 635 
Glu Asn Tyr Gly Arg Ala Ala Cys Arg Asp Met Gly Tyr Lys Asn Asn 
180 185 190 



ttt tac tct age caa gga at a gtg gat gac age gga tee acc age ttt 
Phe Tyr Ser Ser Gin Gly He Val Asp Asp Ser Gly Ser Thr Ser Phe 
195 200 205 



tac cac agt gat gcc tgt tct tea aaa gca gtg gtt tct tta cgc tgt 

Tyr His Ser Asp Ala Cys Ser Ser Lys Ala Val Val Ser Leu Arg Cys 

230 235 240 

tta gcc tgc ggg gtc aac ttg aac tea age cgc cag age agg ate gtg 

Leu Ala Cys Gly Val Asn Leu Asn Ser Ser Arg Gin Ser Arg He Val 
245 250 253 



cac gtc cag aac gtc cac gtg tgc gga ggc tec ate ate ace ccc gag 

His Val Gin Asn Val His Val Cys Gly Gly Ser He He Thr Pro Glu 

275 280 285 

tgg ate gtg aca gcc gcc cac tgc gtg gaa aaa cct ctt aac aat cca 

Trp He Val Thr Ala Ala His Cys Val Glu Lys Pro Leu Asn Asn Pro 

290 295 300 305 

tgg cat tgg acg gca ttt geg ggg att ttg aga caa tct ttc atg ttc 

Trp His Trp Thr Ala Phe Ala Gly He Leu Arg Gin Ser. Phe Met Phe 

310 315 320 

tat gga gcc gga tac caa gta caa aaa gtg att tct eat cca aat tat 

Tyr Gly Ala Gly Tyr Gin Val Gin Lys Val He Ser His Pro Asn Tyr 

325 330 335 



683 



atg aaa ctg aac aca agt gcc ggc aat gtc gat ate tat aaa aaa ctg 731 
Met Lys Leu Asn Thr Ser Ala Gly Asn Val Asp He Tyr Lys Lys Leu 
210 215 220 225 



779 



827 



ggc ggt gag age geg etc ccg ggg gcc tgg ccc tgg cag gtc age ctg 875 
Gly Gly Glu Ser Ala Leu Pro Gly Ala Trp Pro Trp Gin Val Ser Leu 
260 265 270 



923 



971 



1019 



1067 



gac tec aag acc aag aac aat gac att geg ctg atg aag ctg cag aag 1115 

Asp Ser Lys Thr Lys Asn Asn Asp He Ala Leu Met Lys Leu Gin Lys 

340 345 350 

cct ctg act ttc aac gac eta gtg aaa cca gtg tgt ctg ccc aac cca 1163 

Pro Leu Thr Phe Asn Asp Leu Val Lys Pro Val Cys Leu Pro Asn Pro 
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355 360 365 

ggc atg atg ctg cag cca gaa cag etc tgc tgg att tec ggg tgg ggg 1211 

Gly Met Met Leu Gin Pro Glu Gin Leu Cys Trp lie Ser Gly Trp Gly 

370 375 380 385 

gcc acc gag gag aaa ggg aag acc tea gaa gtg ctg aac get gcc aag 1259 

Ala Thr Glu Glu Lys Gly Lys Thr Ser Glu Val Leu Asn Ala Ala Lys 

390 395 400 



gtg ctt etc att gag aca cag aga tgc aac age aga tat gtc tat gac 
Val Leu Leu lie Glu Thr Gin Arg Cys Asn Ser Arg Tyr Val Tyr Asp 
405 410 415 



1307 



aac ctg ate aca cca gcc atg ate tgt gcc ggc ttc ctg cag ggg aac 1355 
Asn Leu lie Thr Pro Ala Met He Cys Ala Gly Phe Leu Gin Gly Asn 
420 425 430 

gtc gat tct tgc cag ggt gac agt gga ggg cct ctg gtc act teg aac 1403 
Val Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Thr Ser Asn 
435 440 445 

aac aat ate tgg tgg ctg ata ggg gat aca age tgg ggt tct ggc tgt 1451 
Asn Asn He Trp Trp Leu He Gly Asp Thr Ser Trp Gly Ser Gly Cys 
450 455 460 465 

gcc aaa get tac aga cca gga gtg tac ggg aat gtg atg gta ttc acg 1499 
Ala Lys Ala Tyr Arg Pro Gly Val Tyr Gly Asn Val Met Val Phe Thr 

470 475 480 

gac tgg att tat cga caa atg aag gca aac ggc taa tccacatggt 1545 
Asp Trp He Tyr Arg Gin Met Lys Ala Asn Gly * 
485 490 

cttcgtcctt gacgtcgttt tacaagaaaa eaatggggct ggttttgctt ccccgtgcat 1605 
gatttactct tagagatgat teagaggtea cttcattttt attaaaeagt gaacttgtct 1665 
ggctttggca ctctctgcca tactgtgcag gctgcagtgg ctcccetgcc cagcctgctc 1725 
tccctaaccc cttgtccgca aggggtgatg gccggctggt tgtgggcact ggcggtcaat 1785 
tgtggaagga agagggttgg aggctgeccc cattgagatc ttcctgctga gtcctttcca 1845 
ggggccaatt ttggatgagc atggagctgt cacttctcag ctgctggatg acttgagatg 
aaaaaggaga gacatggaaa gggagacagc caggtggcac ctgcagcggc tgccctctgg 
ggccacttgg tagtgtcccc agcctactte acaaggggat tttgetgatg ggttcttaga 
gccttagcag ccctggatgg tggccagaaa taaagggacc agcccttcat gggtggtgac 2085 
gtggtagtca cttgtaaggg gaacagaaac atttttgttc ttatggggtg agaatataga ^ta^ 
cagtgccctt ggtgcgaggg aagcaattga aaaggaactt gccctgagca etcctggtgc 
aggtctecac ctgcacattg ggtggggetc ctgggaggga gactcagect tcctcctcat 
cctccctgac ectgctccta gcaccctgga gagtgaatgc cccttggtcc etggcagggc 2325 
gccaagtttg gcaccatgtc ggcctcttca ggcctgatag tcattggaaa ttgaggtcca 
tgggggaaat caaggatgct cagtttaagg tacactgttt ccatgttatg tttctacaca 
ttgatggtgg tgaccctgag ttcaaagcca tctt 

<210> 37 
<211> 492 
<212> PRT 
<213> Homo sapien 

Met°Ali^Leu Asn Ser Gly Ser Pro Pro Ala He Gly Pro Tyr Tyr Glu 

15 10 15 

Asn His Gly Tyr Gin Pro Glu Asn Pro Tyr Pro Ala Gin Pro Thr Val 
20 25 30 



1905 
1965 
2025 
2085 
2145 
2205 
2265 
2325 
2385 
2445 
2479 
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Val Pro Thr Val 


Tyr Glu Val His 


Pro 


Ala 


Gin 


Tyr 


Tyr 


Pro 


Ser 


Pro 


35 


40 










45 








Val Pro Gin Tyx 


Ala Pro Arg Val 


Leu 


Thr 


Gin 


Ala 


Ser 


Asn 


Pro 


Val 


50 


55 








60 










Val Cys Thr Gin 


Pro Lys Ser Pro 


Ser Gly 


Thr 


Val 


Cys 


Thr 


Ser 


Lys 


65 


70 






/b 










80 


Thr Lys Lys Ala 


Leu Cys lie Thr 


Leu 


Thr 


Leu 


Gly 


Thr 


Phe 


Leu 


Val 


85 




90 










95 




Gly Ala Ala Leu Ala Ala Gly Leu 


Leu 


Trp 


Lys 


Phe 


Met 


Gly 


Ser 


Lys 


100 




105 










110 






Cys Ser Asn Ser 


Gly lie Glu Cys 


Asp 


Ser 


Ser 


Gly Thr Cys 


He 


Asn 


115 


120 










125 








Pro Ser Asn Trp Cys Asp Gly Val 


Ser 


His 


Cys 


Pro 


Gly Gly 


Glu Asp 


130 


135 








140 










Glu Asn Arg Cys 


Val Arg Leu Tyr 


Gly 


Pro 


Asn 


Phe 


He 


Leu 


Gin 


Met 


145 


150 






155 










160 


Tyr Ser Ser Gin Arg Lys Ser Trp 


His 


Pro 


Val 


Cys 


Gin Asp 


Asp 


Trp 




165 




170 










175 




Asn Glu Asn Tyr Gly Arg Ala Ala 


Cys 


Arg 


Asp 


Met 


Gly Tyr 


Lys 


Asn 


180 




185 










190 






Asn Phe Tyr Ser 


Ser Gin Gly He 


Val 


Asp 


Asp 


Ser 


Gly 


Ser 


Thr 


Ser 


195 


200 










205 








Phe Met Lys Leu 


Asn Thr Ser Ala 


Gly Asn 


Val 


Asp 


He 


Tyr 


Lys 


Lys 


210 


215 








220 










Leu Tyr His Ser 


Asp Ala Cys Ser 


Ser 


Lys 


Ala 


Val 


Val 


Ser 


Leu 


Arg 


225 


230 






235 










240 


Cys Leu Ala Cys 


Gly Val Asn Leu 


Asn 


Ser 


Ser Arg 


Gin 


Ser 


Arg 


He 


245 




250 










255 




Val Gly Gly Glu 


Ser Ala Leu Pro 


Gly Ala 


Trp 


Pro 


Trp 


Gin 


Val 


Ser 


260 




265 










270 






Leu His Val Gin 


Asn Val His Val 


Cys 


Gly 


Gly 


Ser 


He 


He 


Thr 


Pro 


275 


280 










285 








Glu Tip lie Val 


Thr Ala Ala His 


Cys 


Val 


Glu 


Lys 


Pro 


Leu 


Asn 


Asn 


290 


295 








300 










Pro Trp His Trp 


Thr Ala Phe Ala 


Gly 


He 


Leu 


Arg 


Gin 


Ser 


Phe 


Met 


305 


310 






315 






His 




320 


Phe Tyr Gly Ala Gly Tyr Gin Val 


Gin 


Lys 


Val 


He 


Ser 


Pro 


Asn 




325 




330 










335 




Tyr Asp Ser Lys 


Thr Lys Asn Asn 


Asp 


He 


Ala 


Leu 


Met 


Lys 


Leu 


Gin 


340 




345 










350 






Lys Pro Leu Thr 


Phe Asn Asp Leu 


Val 


Lys 


Pro 


Val 


Cys 


Leu 


Pro 


Asn 


355 


360 










365 








Pro Gly Met Met 


Leu Gin Pro Glu 


Gin 


Leu 


Cys 


Trp 


He 


Ser 


Gly Trp 


370 


375 








380 








Ala 


Gly Ala Thr Glu 


Glu Lys Gly Lys 


Thr 


Ser 


Glu 


Val 


Leu 


Asn 


Ala 


385 


390 






395 








Val 


400 


Lys Val Leu Leu 


He Glu Thr Gin 


Arg 


Cys 


Asn 


Ser Arg 


Tyr 


Tyr 


405 




410 










415 




Asp Asn Leu lie 


Thr Pro Ala Met 


He 


Cys 


Ala 


Gly 


Phe 


Leu 


Gin 


Gly 


420 




425 










430 






Asn Val Asp Ser 


Cys Gin Gly Asp 


Ser 


Gly 


Gly 


Pro 


Leu 


Val 


Thr 


Ser 


435 


440 










445 








Asn Asn Asn lie 


Trp Trp Leu He 


Gly Asp 


Thr 


Ser 


Trp 


Gly 


Ser 


Gly 


450 


455 








460 










Cys Ala Lys Ala 


Tyr Arg Pro Gly 


Val 


Tyr 


Gly Asn Val 


Met 


Val 


Phe 


465 


470 






475 










480 


Thr Asp Trp lie 


Tyr Arg Gin Met 


Lys 


Ala 


Asn 


Gly 










485 




490 















<210> 38 
<2ll> 2079 
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<212> DNA 

<213> Homo sapien 

<220> 
<221> CDS 

<222> (251) . . . (1522) 

<223> Nucleotide sequence encoding transmenibrane 
■ protease, serine 4 {TMPRSS4) 

<300> 

<30e> GenBank NM016425 
<309> 2000-11-06 

<400> 38 

gagaggcagc agcttgttca gcggacaagg atgctgggcg tgagggacca aggcctgccc 60 
tgcactcggg cctcctccag ccagtgctga ccagggactt ctgacctgct ggccagccag 120 
gacctgtgtg gggaggccct cctgctgcct tggggtgaca atctcagctc caggctacag 180 
ggagaccggg aggatcacag agccagcatg gtacaggatc ctgacagtga tcaacctctg 240 
aacagcctcg atg tea aac ccc tgc gca aac ccc gta tec oca tgg aga 289 
Met Ser Asn Pro Cys Ala Asn Pro Val Ser Pro Trp Arg 
15 10 



30 



35 40 45 



cct tea gaa agt gtg ggg ate ccc ate ate at a gca eta ctg age ctg 337 
Pro Ser Glu Ser Val Gly lie Pro He He He Ala Leu Leu Ser Leu 
15 20 25 

gcg agt ate ate att gtg gtt gtc etc ate aag gtg att ctg gat aaa 
Ala Ser lie lie He Val Val Val Leu He Lys Val He Leu Asp Lys 



385 



433 



481 



529 



tac tac ttc etc tge ggg cag cct etc cac ttc ate ccg agg aag cag 
Tyr Tyr Phe Leu Cys Gly Gin Pro Leu His Phe He Pro Arg Lys Gin 

50 55 60 

ctg tgt gac gga gag ctg gac tgt ccc ttg ggg gag gac gag gag cac 
Leu Cys Asp Gly Glu Leu Asp Cys Pro Leu Gly Glu Asp Glu Glu Hifi 
65 70 75 

tat gtc aag age ttc ccc gaa ggg cet gea gtg gca gtc egc etc tec 
Cvs Val Lys Ser Phe Pro Glu Gly Pro Ala Val Ala Val Arg Leu Ser 
^ 80 85 90 

aag gac ega tec aca ctg cag gtg ctg gac teg gee aea ggg aac tgg 577 
Lys Asp Arg Ser Thr Leu Gin Val Leu Asp Ser Ala Thr Gly Asn Trp 
95 100 105 

ttc tct gee tgt ttc gac aac ttc aca gaa get etc get gag aea gee 
Phe Ser Ala Cys Phe Asp Asn Phe Thr Glu Ala Leu Ala Glu Thr Ala 
110 115 120 125 

tgt agg cag atg ggc tac age age aaa ccc act ttc aga get gtg gag 
Cvs Arg Gin Met Gly Tyr Ser Ser Lys Pro Thr Phe Arg Ala Val Glu 

130 135 140 

att ggc cca gac cag gat ctg gat gtt gtt gaa ate aea gaa aac age 
He Gly Pro Asp Gin Asp Leu Asp Val Val Glu He Thr Glu Asn Ser 
145 150 155 

cag gag ctt egc atg egg aac tea agt ggg ccc tgt etc tea ggc tee 
Gin Glu Leu Arg Met Arg Asn Ser Ser Gly Pro Cys Leu Ser Gly Ser 
160 165 170 



625 



673 



721 



769 
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ctg gtc tec ctg cac tgt ctt gcc tgt ggg aag age ctg aag acc ccc 817 
Leu Val Ser Leu His Cys Leu Ala Cys Gly Lys Ser Leu Lys Thr Pro 
175 180 185 

cgt gtg gtg ggt ggg gag gag gcc tct gtg gat tct tgg cct tgg cag 865 
Arg Val Val Gly Gly Glu Glu Ala Ser Val Asp Ser Trp Pro Trp Gin 
190 195 200 205 

gtc age ate cag tac gac aaa cag cac gtc tgt gga ggg age ate ctg 913 
Val Ser lie Gin Tyr Asp Lys Gin His Val Cys Gly Gly Ser lie Leu 

210 215 220 

gac ccc cac tgg gtc etc acg gea gcc cac tgc ttc agg aaa cat acc 961 
Asp Pro His Trp Val Leu Thr Ala Ala His Cys Phe Arg Lys His Thr 
225 230 235 

gat gtg ttc aac tgg aag gtg egg gea ggc tea gac aaa ctg ggc age 
Asp Val Phe Asn Trp Lys Val Arg Ala Gly Ser Asp Lys Leu Gly Ser 
240 245 250 

ttc cca tec ctg get gtg gcc aag ate ate ate att gaa ttc aac ccc 
Phe Pro Ser Leu Ala Val Ala Lys He He He He Glu Phe Asn Pro 
255 260 265 

atg tac ccc aaa gac aat gac ate gcc etc atg aag ctg cag ttc cca 
Met Tvr Pro Lys Asp Asn Asp He Ala Leu Met Lys Leu Gin Phe Pro 
270 275 280 285 

etc act ttc tea ggc aca gtc agg ccc ate tgt ctg ccc ttc ttt gat 
Leu Thr Phe Ser Gly Thr Val Arg Pro He Cys Leu Pro Phe Phe Asp 

290 295 300 

qag gag etc act cca gcc acc cca etc tgg ate att gga tgg ggc ttt 1201 
GlS GlS Leu Thr Pro Ala Thr Pro Leu Trp He He Gly Trp Gly Phe 
305 310 315 

acg aag cag aat gga ggg aag atg tct gac ata ctg ctg cag gcg tea 1249 
Thr Lys Gin Asn Gly Gly Lys Met Ser Asp He Leu Leu Gin Ala Ser 
320 325 330 

gtc cag gtc att gac age aca egg tgc aat gea gac gat gcg tac cag 1297 
Val Gin Val He Asp Ser Thr Arg Cys Asn Ala Asp Asp Ala Tyr Gin 
335 340 345 

qag gaa gtc ace gag aag atg atg tgt gea ggc ate ccg gaa ggg ggt 1345 
Gly Glu Val Thr Glu Lys Met Met Cys Ala Gly He Pro Glu Gly Gly 
350 355 360 365 

gtg gac acc tgc cag ggt gac agt ggt ggg ccc ctg atg tac caa tct 1393 
val Asp Thr Cys Gin Gly Asp Ser Gly Gly Pro Leu Met Tyr Gin Ser 
• 370 375 380 

gac cag tgg cat gtg gtg ggc ate gtt age tgg ggc tat ggc tgc ggg 1441 
Asp Gin Trp His Val Val Gly He Val Ser Trp Gly Tyr Gly Cys Gly 
385 390 395 

ggc ccg age acc cca gga gta tac ace aag gtc tea gcc tat etc aac 1489 
Gly Pro Ser Thr Pro Gly Val Tyr Thr Lys Val Ser Ala Tyr Leu Asn 
400 405 410 



1009 



1057 



1105 



1153 
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tgg ate tac aat gtc tgg aag get gag ctg taa tgctgctgcc cctttgcagt 1542 
Trp lie Tyr Asn Val Trp Lys Ala Glu Leu * 
415 420 

gctgggagcc gcttccttcc tgccctgccc acctggggat cccccaaagt cagacacaga 1602 

gcaagagtcc ccttgggtac acccctctgc ccacagcctc agcatttctt ggagcagcaa 1662 

agggcctcaa ttcctgtaag agaccctcgc agcccagagg cgcccagagg aagtcagcag 1722 

ccctagctcg gccacacttg gtgctcccag catcccaggg agagacacag cccactgaac 1782 

aaggtctcag gggtattgct aagccaagaa ggaactttcc cacactactg aatggaagca 1842 

ggctgtcttg taaaagccca gatcactgtg ggctggagag gagaaggaaa gggtctgcgc 1902 

cagccctgtc cgtcttcacc catccccaag cctactagag caagaaacca gttgtaatat 1962 

aaaatgcact gccctactgt tggtatgact accgttacct actgttgtca ttgttattac 2022 

agctatggcc act at tat t a aagagctgtg taacatcaaa aaaaaaaaaa aaaaaaa 2079 

<210> 39 

<211> 423 

<212> PRT 

<213> Homo sapien 

<400> 39 « „ ^1 

Met Ser Asn Pro Cys Ala Asn Pro Val Ser Pro Trp Arg Pro Ser Glu 

15 10 15 

ser Val Gly lie Pro He He He Ala Leu Leu Ser Leu Ala Ser He 

20 25 30 

He He Val Val Val Leu He Lys Val He Leu Asp Lys Tyr Tyr Phe 

35 40 45 

Leu Cys Gly Gin Pro Leu His Phe He Pro Arg Lys Gin Leu Cys Asp 

50 55 60 

Glv Glu Leu Asp Cys Pro Leu Gly Glu Asp Glu Glu His Cys Val Lys 
65 70 75 80 

Ser Phe Pro Glu Gly Pro Ala Val Ala Val Arg Leu Ser Lys Asp Arg 

85 90 95 

Ser Thr Leu Gin Val Leu Asp Ser Ala Thr Gly Asn Trp Phe Ser Ala 

100 105 110 

Cys Phe Asp Asn Phe Thr Glu Ala Leu Ala Glu Thr Ala Cys Arg Gin 

lis 120 125 

Met Gly Tyr Ser Ser Lys Pro Thr Phe Arg Ala Val Glu He Gly Pro 

130 135 140 

ASP Gin Asp Leu Asp Val Val Glu He Thr Glu Asn Ser Gin Glu Leu 
145 150 155 160 

Arg Met Arg Asn Ser Ser Gly Pro Cys Leu Ser Gly Ser Leu Val Ser 

165 170 175 

Leu His Cys Leu Ala Cys Gly Lys Ser Leu Lys Thr Pro Arg Val Val 

180 185 190 

Gly Gly Glu Glu Ala Ser Val Asp Ser Trp Pro Trp Gin Val Ser He 

195 200 205 

Gin Tyr Asp Lys Gin His Val Cys Gly Gly Ser He Leu Asp Pro His 

210 215 220 

Trp Val Leu Thr Ala Ala His Cys Phe Arg Lys His Thr Asp Val Phe 
225 230 235 240 

Asn Trp Lys Val Arg TVla Gly Ser Asp Lys Leu Gly Ser Phe Pro Ser 

245 250 255 

Leu Ala Val Ala Lys He He He He Glu Phe Asn Pro Met Tyr Pro 

260 265 270 

Lys Asp Asn Asp He Ala Leu Met Lys Leu Gin Phe Pro Leu Thr Phe 

275 280 285 

Ser Gly Thr Val Arg Pro He Cys Leu Pro Phe Phe Asp Glu Glu Leu 

290 295 300 

Thr Pro Ala Thr Pro Leu Trp He He Gly Trp Gly Phe Thr Lys Gin 
305 310 315 320 

Asn Gly Gly Lys Met Ser Asp He Leu Leu Gin Ala Ser Val Gin Val 
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325 








lie 


Asp 


Ser 


Thr Arg 


Cys 


Asn 


Ala 








340 










Thr 


Glu 


Lys 
355 


Met 


Met 


Cys 


Ala 


Gly 
360 


Cys 


Gin 


Gly 


Asp 


Ser 


Gly 


Gly 


Pro 


370 










375 




His 


Val 


Val 


Gly 


lie 


Val 


Ser 


Trp 


385 










390 






Thr 


Pro 


Gly 


Val 


Tyr 
405 


Thr 


Lys 


Val 


Asn 


Val 


Trp 


Lys 
420 


Ala 


Glu 


Leu 





330 335 



Asp 


Asp Ala Tyr 


Gin Gly Glu 


Val 


345 




350 






He 


Pro Glu Gly Gly Val 


Asp 


Thr 






365 






Leu 


Met Tyr Gin 


Ser Asp 


Gin 


Trp 




380 








Gly 


Tyr Gly Cys 


Gly Gly 


Pro 


Ser 


395 






400 


Ser 


Ala Tyr Leu 


Asn Trp 


He 


Tyr 




410 




415 





<210> 40 
<211> 1471 
<212> DNA 

<213> Artifcial sequence 



<220> 

<223> DESCl gene 

<221> misc_feature 
<222> (626) . . . {1324) 
<223> protease domain 

<221> CDS 

<222> (56) . . . (1324) 



<400> 40 ^ 4_ 4- CO 

tgacttggat gtagacctcg accttcacag gactcttcat tgctggttgg caatg atg 5B 

1 

tat egg cca gat gtg gtg agg get agg aaa aga gtt tgt tgg gaa ccc 106 
TVr Arc Pro Asp Val Val Arg Ala Arg Lys Arg Val Cys Trp Glu Pro 
'5 10 15 

tgg gtt ate ggc etc gtc ats ttc ata tee etg att gtc ctg gca gtg 154 
Tro Val He Gly Leu Val Xaa Phe He Ser Leu He Val Leu Ala Val 
20 25 30 

tgc att gga stc act gtt cat tat gtg aga tat aat caa aag aag acc 202 
Cvs He Gly Xaa Thr Val His Tyr Val Arg Tyr Asn Gin Lys Lys Thr 
35 40 45 

tac aat tac tat age aca ttg tea ttt aca act gac aaa eta tat get 250 
Tyr Asn Tyr Tyr Ser Thr Leu Ser Phe Thr Thr Asp Lys Leu Tyr Ala 
50 55 60 65 

gag ttt ggc aga gag get tct aac aat ttt aca gaa atg age cag aga 298 
Glu Phe Gly Arg Glu Ala Ser Asn Asn Phe Thr Glu Met Ser Gin Arg 

70 75 80 



346 



Gtt gaa tea atg gtg aaa aat gca ttt tat aaa tct cca tta agg gaa 
Leu Glu Ser Met Val Lys Asn Ala Phe Tyr Lys Ser Pro Leu Arg Glu 
85 90 95 

gaa ttt gtc aag tct cag gtt ate aag ttc agt caa cag aag cat gga 394 
Glu Phe Val Lys Ser Gin Val He Lys Phe Ser Gin Gin Lys His Gly 
100 105 110 
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gtg ttg get cat atg ctg ttg att tgt aga ttt cac tct act gag gat 442 
Val Leu Ala His Met Leu Leu lie Cys Arg Phe His Ser Thr Glu Asp 
115 120 125 

cct gaa act gta gat aaa att gtt caa ctt gtt tta cat gaa aag ctg 490 
Pro Glu Thr Val Asp Lys lie Val Gin Leu Val Leu His Glu Lys Leu 
130 135 140 145 

caa gat get gta gga ccc cct aaa gta gat cct cac tea gtt aaa att 53 8 

Gin Asp Ala Val Gly Pro Pro Lys Val Asp Pro His Ser Val Lys lie 

150 155 160 

aaa aaa ate aac aag aca gaa aca gac age tat eta aac cat tgc tgc 586 
Lys Lys lie Asn Lys Thr Glu Thr Asp Ser Tyr Leu Asn His Cys Cys 
165 170 175 

gga aca ega aga agt aaa act eta ggt cag agt etc agg ate gtt ggt 634 
Gly Thr Arg TVrg Ser Lys Thr Leu Gly Gin Ser Leu Arg lie Val Gly 
180 185 190 

ggg aca gaa gta gaa gag ggt gaa tgg ccc tgg cag get age ctg cag 682 
Gly Thr Glu Val Glu Glu Gly Glu Trp Pro Trp Gin Ala Ser Leu Gin 
195 200 205 

tgg gat ggg agt eat cgc tgt gga gca ace tta att aat gcc aca tgg 73 0 

Trp Asp Gly Ser His Arg Cys Gly Ala Thr Leu lie Asn Ala Thr Trp 
210 215 220 225 

ctt gtg agt get get cac tgt ttt aca aca tat aag aac cct gcc aga 778 
Leu Val Ser Ala Ala His Cys Phe Thr Thr Tyr Lys Asn Pro Ala Arg 

230 235 240 



tgg act get tee ttt gga gta aca ata aaa cct teg aaa atg aaa egg 

Trp Thr Ala Ser Phe Gly Val Thr lie Lys Pro Ser Lys Met Lys Arg 

245 250 255 

ggt etc egg aga ata att gtc cat gaa aaa tac aaa cac eca tea eat 

Gly Leu Arg Arg He He Val His Glu Lys Tyr Lys His Pro Ser His 
260 265 270 

gac tat gat att tct ctt gca gag ctt tct age cct gtt ccc tac aca 

Asp Tyr Asp He Ser Leu Ala Glu Leu Ser Ser Pro Val Pro Tyr Thr 

275 280 285 

aat gca gta cat aga gtt tgt etc cct gat gca tec tat gag ttt caa 

Asn Ala Val His Arg Val Cys Leu Pro Asp Ala Ser Tyr Glu Phe Gin 

290 295 300 305 

cca ggt gat gtg atg ttt gtg aca gga ttt gga gca ctg aaa aat gat 

Pro Gly Asp Val Met Phe Val Thr Gly Phe Gly Ala Leu Lys Asn Asp 

310 315 320 

ggt tac agt caa aat cat ctt ega caa gca cag gtg act etc ata gac 

Gly Tyr Ser Gin Asn His Leu Arg Gin Ala Gin Val Thr Leu He Asp 

325 330 335 

get aca act tgc aat gaa cct caa get tac aat gac gcc ata act cct 

Ala Thr Thr Cys Asn Glu Pro Gin Ala Tyr Asn Asp Ala He Thr Pro 
340 345 350 



826 



874 



922 



970 



1018 



1066 



1114 
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aga atg tta tgt get ggc tec tta gaa gga aaa aca gat gca tgc cag 1162 
Arg Met Leu Cys Ala Gly Ser Leu Glu Gly Lys Thr Asp Ala Cys Gin 
355 360 365 

ggt gac tct gga gga cca ctg gtt agt tea gat get aga gat ate tgg 1210 
Gly Asp Ser Gly Gly Pro Leu Val Ser Ser Asp Ala Arg Asp He Trp 
370 375 380 385 

tac ctt get gga at a gtg age tsg gga gat gaa tgt gcg aaa ccc aae 1258 
Tyr Leu Ala Gly He Val Ser Xaa Gly Asp Glu Cys Ala Lys Pro Asn 

390 395 400 

aag cct ggt gtt tat act aga gtt acg gee ttg egg gac tgg att act 1306 
Lys Pro Gly Val Tyr Thr Arg Val Thr Ala Leu Arg Asp Trp He Thr 
405 410 415 

tea aaa aet ggt ate taa gagagaaaag ccteatggaa cagataacat 1354 
Ser Lys Thr Gly He * 
420 

ttttttttgt tttttgggtg tggaggccat ttttagagat acagaattgg agaagacttg 1414 
caaaacaget agatttgact gatctcaata aae t gtt tgc ttgatgeaaa aaaaaaa 1471 

<210> 41 

<211> 422 

<212> PRT 

<213> Homo Saplen 

<220> 

<221> VARIANT 

<222> 24 

<223> Xaa is He or Met 



<221> VARIANT 
<222> 37 

<223> Xaa is Leu or Val 

<221> VARIANT 
<222> 393 

<223> Xaa is Ser or Trp 



<400> 41 



Met 


Tyr 


Arg 


Pro 


Asp 


Val 


Val 


Arg 


Ala 


Arg 


Lys 


Arg 


Val 


Cys 


Trp 


Glu 


1 




5 










10 










15 




Pro 


Trp 


Val 


He 


Gly 


Leu 


Val 


Xaa 


Phe 


He 


Ser 


Leu 


He 


Val 


Leu 


Ala 






20 










25 










30 






Val 


Cys 


He 


Gly 


Xaa 


Thr 


val 


His 


Tyr 


Val 


Arg 


Tyr 


Asn 


Gin 


Lys 


Lys 




35 








40 










45 








Thr 


Tyr 


Asn 


Tyr 


Tyx 


Ser 


Thr 


Leu 


Ser 


Phe 


Thr 


Thr 


Asp 


Lys 


Leu 


Tyx 




50 








55 










60 










Ala 


Glu 


Phe 


Gly 


Arg 


Glu 


Ala 


Ser 


Asn 


Asn 


Phe 


Thr 


Glu 


Met 


Ser 


Gin 


65 






70 










75 










80 


Arg 


Leu 


Glu 


Ser 


Met 


Val 


Lys 


Asn 


Ala 


Phe 


Tyr 


Lys 


Ser 


Pro 


Leu 


Arg 








85 










90 










95 




Glu 


Glu 


Phe 


Val 
100 


Lye 


Ser 


Gin 


Val 


He 
105 


Lys 


Phe 


Ser 


Gin 


Gin 
110 


Lys 


His 


Gly 


Val 


Leu 


Ala 


His 


Met 


Leu 


Leu 


He 


Cys 


Arg 


Phe 


His 


Ser 


Thr 


Glu 




115 










120 










125 








Asp 


Pro 


Glu 


Thr 


Val 


Asp 


Lys 


He 


Val 


Gin 


Leu 


Val 


Leu 


His 


Glu 


Lys 


130 










135 










140 


His 








Leu 


Gin 


Asp 


Ala 


Val 


Gly 


Pro 


Pro 


Lys 


Val 


Asp 


Pro 


Ser 


Val 


Lys 
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145 








150 








155 








1 ct\ 


He 


T ■ 

Lys 


Lys 


He Asn 


Lys Thr 


Glu 


Thr 


Asp 


Ser 


Tyr 


Leu Asn 


nxs 


Cys 






165 








170 








J. / 3 




Cys 


Gly 


Tiir 


Arg Arg 


Ser Lys 


Thr Leu Gly Gin 




jjcu Arg 




vax 








180 






185 








1 on 








Gly 


Thr 


Glu Val 


Glu Glu 


Gly Glu 


Trp 


Pro 




V3j.n AJ.a 




_ 

Leu 










200 










^ vr ^ 








Trp 


Asp 


Gly Ser His Arg 


Cys 


Gly Ala Thr 




± JLc Asn 




i.xir 




210 






215 


















Trp 


Leu 


vax 


Ser Ala 


Ala His 


Cys 


Phe 


Thr 


Thr 


lyr 


Lys Asn 


Pro 


Axa 


T rr 








230 








235 








A U 


Arg 


Trp 


Thr 


Ala Ser 


Phe Gly 


Val 


Thr 


He 


Lys 


Pro 


Ser Lys 


Met 


Lys 




245 








250 








255 




Aarg 


Gly 


Leu 


Arg Arg 


He lie 


Val 


His 


Glu 


Lys 


Tyr 


Lys His 


Pro 


oer 




260 






265 








270 






His 


Asp 


Ty^^ 


Asp He 


Ser Leu 


Ala 


Glu 


Leu 


Ser 


Ser 


Pro Val 


Pro 


lyr 








260 










285 






Thr 


Asn 


Ala 


Val His Arg Val 


Cys 


Leu 


Pro 


Asp 


Ala 


Ser Tyr 


Glu 


Phe 




290 






295 










300 








Gin 


Pro 


Gly 


Asp Val 


Met Phe 


Val 


Thr 


Gly 


Phe 


Gly 


Ala Leu 


Lys 


Asn 


305 




310 








315 








320 


Asp 


Gly 


Tyr 


Ser Gin 


Asn His 


Leu 


Arg 


Gin 


Ala 


Gin 


Val Thr 


Leu 


He 


325 








330 








335 




Asp 


Ala 


Thr 


Thr Cys 


Asn Glu 


Pro 


Gin 


Ala 


Tyr 


Asn 


Asp Ala 


He 


Thr 






340 






345 








350 






Pro 


Arg 


Met 


Leu Cys 


Ala Gly 


Ser Leu Glu Gly 


Lys 


Thr Asp Ala 


Cys 




355 






360 










365 






Gin 


Gly 


Asp 


Ser Gly Gly Pro 


Leu 


Val 


Ser 


Ser Asp 


Ala Arg Asp 


He 




370 




375 










380 








Trp 


Tyr 


Leu 


Ala Gly 


He Val 


Ser 


Xaa 


Gly Asp Glu 


Cys Ala 


Lys 


Pro 


385 






390 








395 








400 


Asn 


Lys 


Pro 


Gly Val 


Tyr Thr 


Arg 


Val 


Thr 


Ala 


Leu 


Arg Asp 


Trp 


He 






405 








410 








415 




Thr 


Ser 


Lye 


Thr Gly 


He 



















420 



<210> 42 

<211> 1257 

<212> DMA 

<213> Homo sapien 

<220> 

<221> CDS 

<222> (1) . . . (1257) 

<223> Nucleotide sequence encoding MTSP9 



<400> 42 

atg atg tat egg aca gta gga ttt ggc acc cga age aga aat ctg aag 48 
Met Met Tyr Arg Thr Val Gly Phe Gly Thr Arg Ser Arg Asn Leu Lys 
IS 10 15 

cca tgg atg att gcc gtt etc att gtg ttg tec etg aca gtg gtg gca 96 
Pro Trp Met He Ala Val Leu He Val Leu Ser Leu Thr Val Val Ala 
20 25 30 



gtg aee ata ggt etc ctg gtt cac ttc eta gta ttt gae eaa aaa aag 
Val Thr He Gly Leu Leu Val His Phe Leu Val Phe Asp Gin Lys Lys 
35 40 45 



144 



gag tae tat cat gge tec ttt aaa att tta gat cea eaa ate aat aac 
Glu Tyr Tyr His Gly Ser Phe Lys He Leu Asp Pro Gin He Asn Asn 



192 



wo 02/095007 



PCT/US02/16819 



-84- 



50 55 60 

aat ttc gga caa age aac aca tat caa ctt aag gac tta cga gag acg 24 0 

Asn Phe Gly Gin Ser Asn Thr Tyr Gin Leu Lys Asp Leu Arg Glu Thr 

65 70 75 80 

acc gaa aat ttg gtg gat gag at a ttt ata gat tea gcc tgg aag aaa 288 

Thr Glu Asn Leu Val Asp Glu lie Phe He Asp Ser Ala Trp Lys Lys 

85 90 95 

aat tat ate aag aac eaa gta gtc aga ctg act cca gag gaa gat ggt 336 

Asn Tyr He Lys Asn Gin Val Val Arg Leu Thr Pro Glu Glu Asp Gly 

100 105 110 

gtg aaa gta gat gtc att atg gtg ttc cag ttc cce tct act gaa caa 384 

Val Lys Val Asp Val He Met Val Phe Gin Phe Pro Ser Thr Glu Gin 

115 120 125 

agg gca gta aga gag aag aaa ate caa age ate tta aat cag aag ata 432 

Arg Ala Val Arg Glu Lys Lys He Gin Ser He Leu Asn Gin Lys He 

130 135 140 



agg aat tta aga gcc ttg cca ata aat gcc tea tea gtt caa gtt aat 
Arg Asn Leu Arg Ala Leu Pro He Asn Ala Ser Ser Val Gin Val Asn 
145 150 155 160 

gca atg age tea tea aca ggg gag tta act gtc caa gca agt tgt ggt 
Ala Met Ser Ser Ser Thr Gly Glu Leu Thr Val Gin Ala Ser Cys Gly 

165 170 175 

aaa cga gtt gtt cca tta aac gtc aac aga ata gca tct gga gtc att 
Lys Arg Val Val Pro Leu Asn Val Asn Arg He Ala Ser Gly Val He 
180 185 190 



gca gca cac tgc ttc cag aag tat aaa aat cca cat caa tgg act gtt 

Ala Ala His Cys Phe Gin Lys Tyr Lys Asn Pro His Gin Trp Thr Val 

225 230 235 240 

agt ttt gga aca aaa ate aac cct cce tta atg aaa aga aat gtc aga 

Ser Phe Gly Thr Lys He Asn Pro Pro Leu Met Lys Arg Asn Val Arg 

245 250 255 

aga ttt att ate cat gag aag tac cgc tct gca gca aga gag tac gac 

Arg Phe He He His Glu Lys Tyr Arg Ser Ala Ala Arg Glu Tyr Asp 

260 265 270 

att get gtt gtg cag gtc tct tec aga gtc acc ttt teg gat gac ata 

He Ala Val Val Gin Val Ser Ser Arg Val Thr Phe Ser Asp Asp He 

275 280 285 

cgc egg att tgt ttg cca gaa gcc tct gca tec ttc caa cca aat ttg 

Arg Arg He Cys Leu Pro Glu Ala Ser Ala Ser Phe Gin Pro Asn Leu 

290 295 300 



480 



528 



576 



gca cce aag gcg gcc tgg cct tgg caa get tec ctt cag tat gat aac 624 

Ala Pro Lys Ala Ala Trp Pro Trp Gin Ala Ser Leu Gin Tyr Asp Asn 

195 200 205 

ate cat cag tgt ggg gcc acc ttg att agt aac aca tgg ctt gtc act 

He His Gin Cys Gly Ala Thr Leu He Ser Asn Thr Trp Leu Val Thr 

210 215 220 



672 



720 



768 



816 



864 



912 



wo 02/095007 



PCT/USD2/16819 



-85- 



act gtc cac ate aca gga ttt gga gca ctt tac tat ggt ggg gaa tec 
Thr Val His lie Thr Gly Phe Gly Ala Leu Tyr Tyr Gly Gly Glu Ser 
305 310 315 320 

caa aat gat etc cga gaa gcc aga gtg aaa ate at a agt gac gat gtc 
Gin Asn Asp Leu Arg Glu Ala Arg Val Lys lie lie Ser Asp Asp Val 

325 330 335 

tgc aag caa cca cag gtg tat ggc aat gat ata aaa cct gga atg ttc 
Cys Lys Gin Pro Gin Val Tyr Gly Asn Asp He Lys Pro Gly Met Phe 
340 345 350 



<400> 43 




















Met 


Met 


Tyr 


Arg 


Thr 


Val 


Gly 


Phe 


Gly Thr Arg 


Ser 


1 




5 










10 






Pro 


Trp 


Met 


He 


Ala 


Val 


Leu 


He 


Val 


Leu 


Ser 


Leu 






20 










25 








Val 


Thr 


He 


Gly 


Leu 


Leu 


Val 


His 


Phe 


Leu 


Val 


Phe 






35 








40 










Glu 


Tyr 


Tyr 


His 


Gly 


Ser 


Phe 


Lys 


He 


Leu Asp 


Pro 


50 






55 








60 


Asn 


Phe 


Gly 


Gin 


Ser 


Asn 


Thr Tyr 


Gin 


Leu 


Lys 


Asp 


65 








70 








He 


75 




Thr 


Glu 


Asn 


Leu 


Val 


Asp 


Glu 


He 


Phe 


Asp 


Ser 








85 










90 






Asn 


Tyr 


He 


Lys 


Asn 


Gin 


Val 


Val 


Arg 


Leu 


Thr 


Pro 






100 










105 








val 


Lys 


Val 


Asp 


Val 


He 


Met 


Val 


Phe 


Gin 


Phe 


Pro 




1X5 








120 






He 




Arg 


Ala 


Val 


Arg 


Glu 


Lys 


Lys 


He 


Gin 


Ser 


Leu 


130 








135 










140 


Arg 


Asn 


Leu 


Arg 


Ala 


Leu 


Pro 


He 


Asn 


Ala 


Ser 


Ser 


145 








150 










155 


Gin 


Ala 


Met 


Ser 


Ser 


Ser 


Thr 


Gly Glu 


Leu 


Thr 


Val 








165 










170 







9S0 



1008 



1056 



1200 



1248 



1257 



tgt gcc gga tat atg gaa gga att tat gat gcc tgc agg ggt gat tct 1104 
Cys Ala Gly Tyr Met Glu Gly He Tyr Asp Ala Cys Arg Gly Asp Ser 
355 3S0 355 

ggg gga cct tta gtc aca agg gat ctg aaa gat acg tgg tat etc att 1152 
Gly Gly Pro Leu Val Thr Arg Asp Leu Lys Asp Thr Trp Tyr Leu He 
370 375 380 

gga att gta age tgg gga gat aac tgt ggt caa aag gac aag cct gga 
Glv He Val Ser Trp Gly Asp Asn Cys Gly Gin Lys Asp Lys Pro Gly 
385 390 395 400 

gtc tac aca eaa gtg act tat tac cga aac tgg att get tea aaa aca 
Val Tyr Thr Gin Val Thr Tyr Tyr Arg Asn Trp He Ala Ser Lys Thr 

405 410 415 

ggc ate taa 
Gly He * 

<210> 43 
<211> 418 
<212> PRT 
<213> Homo sapien 



15 

Thr Val Val Ala 
30 

Asp Gin Lys Lys 
45 

Gin He Asn Asn 

Leu Arg Glu Thr 
80 

Ala Trp Lys Lys 
95 

Glu Glu Asp Gly 
110 

Ser Thr Glu Gin 
125 

Asn Gin Lys He 

Val Gin Val Asn 
160 

Ala Ser Cys Gly 
175 
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Lys 


Arg 


Val 


Val 


Pro 


Leu 


Asn 


Val Asn Arg 


lie 




C^Av /^T^r TffiT 

ber \3iy vai 






180 








lob 






ion 




Ala 


Pro 


Lys 

195 


Ala 


Ala 


Trp 


Pro 


Trp Gin Ala 
200 


Ser 


Leu 


Gin Tyr Asp 
one 


Asn 


He 


His 


Gin 


Cys 


Gly 


Ala 


Thr 


Leu He Ser 


Asn 


Thr 


Trp Leu Val 


Tnr 




210 






215 












Ala 


Ala 


His 


Cys 


Phe 


Gin 


Lys 


Tyr Lys Asn 


Pro 


H:i.s 


GJ.I1 Trp 1112: 


Val 


225 








230 












540 


Ser 


Phe 


Gly 


Thr 


Lys 


He 


Asn 


Pro Pro Leu 


Met 


Lys 


Axg Asn Va.x 


Arg 








245 






250 






^ o 3 




Arg 


Phe 


He 


He 


His 


Glu 


Lys 


Tyr Arg Ser 


Ala 


Ala 


Arg t»xu iyr 


Asp 






260 








265 






^ / u 


He 


He 


Ala 


Val 
275 


Val 


Gin 


Val 


Ser 


Ser Arg Val 
280 


Thr 


Phe 


Ser Asp Asp 
285 


Ara Arq 


He 


Cys 


Leu 


Pro 


Glu 


Ala Ser Ala 


Ser 


Phe 


Gin Pro Asn 


Leu 




290 








295 






300 


Gly Gly Glu 




Thr 


Val 


His 


He 


Thr 


Gly 


Phe 


Gly Ala Leu 


Tyr 


Tyr 


Ser 


305 










310 






315 






320 


Gin 


Asn 


Asp 


Leu 


Arg 


Glu 


Ala 


Arg Val Lys 


He 


He 


Ser Asp Asp 


Val 








325 






330 






335 


Phe 


Cys 


Lys 


Gin 


Pro 


Gin 


Val 


Tyr Gly Asn Asp 


He 


Lys 


Pro Gly Met 




340 








345 






350 




Cys 


Ala Gly 


Tyr 


Met 


Glu Gly 


He Tyr Asp 


Ala 


Cys 


Arg Gly Asp 


Ser 




355 








360 






365 


He 


Gly Gly 


Pro 


Leu 


Val 


Thr Arg Asp Leu Lys 


Asp 


Thr 


Trp Tyr Leu 




370 










375 






380 




Gly 


Gly 


He 


Val 


Ser 


Trp 


Gly Asp Asn Cys Gly 


Gin 


Lys 


Asp Lys Pro 


385 








390 






395 






400 


Val 


Tyr 


Thr 


Gin 


Val 


Thr 


Tyr 


Tyr Arg Asn 


Trp 


He 


Ala Ser Lys 


Thr 






405 




410 






415 





Gly He 

<210> 44 

<211> 2130 

<212> DNA 

<213> Homo Sapien 

<220> 

<221> CDS 

<222> (0) . - . (2104) 

<223> Nucleotide sequence encoding MTSPIO, including 
MTSPIO protease domain 

<400> 44 

ata aac ctg gtt tat aca aca tct gcc 

He Asn Leu Val Tyr Thr Thr Ser Ala 

1 5 

tct gtt gtt gca gat gtc age age aac 
Ser Val Val Ala Asp Val Ser Ser Asn 
20 25 

cac ttt tgg att gtt ttt gtc atg cca 
His Phe Trp He Val Phe Val Met Pro 
35 40 

tgt gaa gac tgt gtt gcc gcc ate ttg 
Cys Glu Asp Cys Val Ala Ala He Leu 
SO 55 

ate ata aac egg acc tct gtg ggg age 



ttc tec aaa ttt tat gag cag 48 
Phe Ser Lys Phe Tyr Glu Gin 
10 15 

aac aaa ggc ggc etc ctt gtc 96 
Asn Lys Gly Gly Leu Leu Val 

30 

cgt gcc aaa ggc cac ate ttc 144 
Arg Ala Lys Gly His He Phe 
45 

aag gac tec ate cag aca age 192 
Lys Asp Ser He Gin Thr Ser 
60 

ttg cag gga ctg get gtg gac 240 
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He He Asn Arg Thr Ser Val Gly Ser Leu Gin Gly Leu Ala Val Asp 
65 70 75 80 

atg gac tct gtg gta eta aat get ggg ctt egg tea gat tac teg tea 
Met Asp Ser Val Val Leu Asn Ala Gly Leu Arg Ser Asp Tyr Ser Ser 

85 90 95 



ctg tct etc cac tac ecg etg gag att tct gea gee tea ggg agg ctg 
Leu Ser Leu His Tyr Pro Leu Glu He Ser Ala Ala Ser Gly Arg Leu 
115 120 125 



tea ate aag tec ate caa ate gaa gee gae aae tgt gte act gac tec 
Ser He Lys Ser He Gin He Glu Ala Asp Asn Cys Val Thr Asp Ser 
145 150 155 160 

Ctg ace att tac gac tec ctt ttg cec ate egg age age ate ttg tac 
Leu Thr He Tyr Asp Ser Leu Leu Pro He Arg Ser Ser He Leu Tyr 

165 170 175 



aat etc atg ttg gtg aca ttt aag tet ect cat ata egg agg etc tea 
Asn Leu Met Leu Val Thr Phe Lys Ser Pro His He Arg Arg Leu Ser 
195 200 205 

gga ate egg gCa tat ttt gag gtc att eca gaa caa aag tgt gaa aac 
Gly He Arg Ala Tyr Phe Glu Val He Pro Glu Gin Lys Cys Glu Asn 
210 215 220 

aca gtg ttg gtc aaa gae ate act ggc ttt gaa ggg aaa att tea age 
Thr Val Leu Val Lys Asp He Thr Gly Phe Glu Gly Lys He Ser Ser 
225 230 235 240 

cca tat tac ecg age tac tat cct cca aaa tgc aag tgt acc tgg aaa 
Pro Tyr Tyr Pro Ser Tyr Tyr Pro Pro Lys Cys Lye Cys Thr Trp Lys 

245 250 255 

ttt cag act tct eta tea act ctt ggc ata gea ctg aaa ttc tat aac 
Phe Gin Thr Ser Leu Ser Thr Leu Gly He Ala Leu Lys Phe Tyr Asn 
260 265 270 

tat tea ata ace aag aag agt atg aaa ggc tgt gag cat gga tgg tgg 
Tyr Ser He Thr Lys Lys Ser Met Lys Gly Cys Glu His Gly Trp Trp 
275 280 285 

gaa att tat gag cac atg tac tgt ggc tec tac atg gat cat cag aca 
Glu He Tyr Glu His Met Tyr Cys Gly Ser Tyr Met Asp His Gin Thr 
290 295 300 

att ttt cga gtg cec age ect ctg gtt cac att cag etc cag tgc agt 
He Phe Arg Val Pro Ser Pro Leu Val His He Gin Leu Gin Cye Ser 



288 



ace ata gga tct gae aaa ggc tgc tct cag tac ttc tat gea gag cat 336 
Thr He Gly Ser Asp Lys Gly Cys Ser Gin Tyr Phe Tyr TVla Glu His 
100 105 110 



3 84 



atg tgt eac ttc aag ctg gtg gee ata gtg ggc tac etg att cgt etc 432 
Met Cys His Phe Lys Leu Val Ala He Val Gly Tyr Leu He Arg Leu 
130 135 140 



480 



528 



aga att tgt gaa cec aca aga aca tta atg tea ttt gtt tct aca aat 576 
Arg He Cys Glu Pro Thr Arg Thr Leu Met Ser Phe Val Ser Thr Asn 
180 185 190 



624 



672 



720 



768 



816 



664 



912 



960 



wo 02/095007 



PCT/US02/16819 



-88- 



305 310 315 320 

tea agg ctt tea ggc aag cca ctt ttg gca gaa tat ggc agt tac aac 1008 
Ser Arg Leu Ser Gly Lys Pro Leu Leu Ala Glu Tyr Gly Ser Tyr Asn 

325 330 335 



ate agt caa ccc tgc cct gtg gga tct ttt aga tgc tec tec ggt tta 
lie Ser Gin Pro Cys Pro Val Gly Ser Pbe Arg Cys Ser Ser Gly Leu 
340 345 -350 



gga ggc aca gac ace etg gag ggg ggt tgg ccg tgg cag gtc age etc 
Gly Gly Thr Asp Thr Leu Glu Gly Gly Trp Pro Trp Gin Val Ser Leu 
465 470 475 480 

cae ttt gtt gga tct gcc tae tgt ggt gcc tea gtc ate tec agg gag 
His Phe val Gly Ser Ala Tyr Cys Gly Ala Ser Val lie Ser Arg Glu 

485 490 495 



1056 



tgt gtc eet cag gee cag cgt ggt gat gga gta aat gac tgc ttt gat 1104 
Cys Val Pro Gin Ala Gin Arg Gly Asp Gly Val Asn Asp Cys Phe Asp 
355 360 365 

gaa agt gat gaa ctg ttt tgc gtg age cct caa cct gee tgc aat aec 1152 
Glu Ser Asp Glu Leu Phe Cys Val Ser Pro Gin Pro Ala Cys TVsn Thr 
370 375 380 

age tec ttc agg cag cat ggc cct etc ate tgt gat ggc ttc agg gac 1200 
Ser Ser Phe Arg Gin His Gly Pro Leu He Cys Asp Gly Phe Arg Asp 
385 390 395 400 

tgt gag aat ggc egg gat gag caa aac tgc act caa agt att cca tgc 
Cys Glu Asn Gly Arg Asp Glu Gin Asn Cys Thr Gin Ser He Pro Cys 

405 410 415 

aac aac aga act ttt aag tgt ggc aat gat att tgc ttt agg aaa caa 
Asn Asn Arg Thr Phe Lys Cys Gly Asn Asp He Cys Phe Arg Lys Gin 
420 425 430 

aat gca aaa tgt gat ggg aca gtg gat tgt cca gat gga agt gat gaa 1344 
Asn Ala Lys Cys Asp Gly Thr Val Asp Cys Pro Asp Gly Ser Asp Glu 
435 440 445 

gaa ggc tgc acc tgc age agg agt tec tec gee ctt cae cge ate ate 1392 
Glu Gly Cys Thr Cys Ser Airg Ser Ser Ser Ala Leu His Arg He He 
450 455 460 



1248 



1296 



1440 



1488 



tgg ctt ctt tct gca gee cac tgt ttt cat gga aac agg ctg tea gat 1536 

Trp lieu Leu Ser Ala Ala His Cys Phe His Gly Asn Arg Leu Ser Asp 
500 505 510 

ccc aca cca tgg act gca cac etc ggg atg tat gtt cag ggg aat gcc 1584 

Pro Thr Pro Trp Thr Ala His Leu Gly Met Tyr Val Gin Gly Asn Ala 
515 520 525 

aag ttt gtc tec ccg gtg aga aga att gtg gtc cac gag tac tat aac 

Lys Phe Val Ser Pro Val Arg Arg He Val Val His Glu Tyr Tyr Asn 
530 535 540 

agt cag act ttt gat tat gat att get ttg eta cag etc agt att gcc 

Ser Gin Thr Phe Asp Tyr Asp He Ala Leu Leu Gin Leu Ser He Ala 
545 550 555 560 



1632 



1680 



wo 02/095007 



PCTrtJS02/16819 



-89- 



tgg cct gag acc ctg aaa cag etc att cag cca ata tgc att cct ccc 172 8 

Trp Pro Glu Thr Leu Lys Gin Leu lie Gin Pro lie Cys lie Pro Pro 

565 570 575 

act ggt cag aga gtt cgc agt ggg gag aag tgc tgg gta act ggc tgg 1776 
Thr Gly Gin Arg Val Arg Ser Gly Glu Lys Cys Trp Val Thr Gly Trp 
580 585 590 

ggg cga aga cac gaa gca gat aat aaa ggc tec etc gtt ctg cag caa 1824 
Gly Arg Arg His Glu Ala Asp Asn Lys Gly Ser Leu Val Leu Gin Gin 
595 600 605 

gcg gag gta gag etc att gat caa acg etc tgt gtt tec aee tac ggg 1872 
Ala Glu Val Glu Leu He Asp Gin Thr Leu Cys Val Ser Thr Tyr Gly 
610 615 620 

ate ate act tct egg atg etc tgt gca ggc ata atg tea gge aag aga 
He He Thr Ser Arg Met Leu Cys Ala Gly He Met Ser Gly Lys Arg 
625 630 635 640 



aaaaaaaaaa aaaaaa 

<210> 45 

<211> 700 

<212> PRT 

<213> Homo Sapien 



He 


Asn 


Leu 


Val 


Tyr 


Thr 


Thr 


Ser 


Ala Phe 


1 








5 








10 


Ser 


Val 


Val 


Ala 


Asp 


Val 


Ser 


Ser 


Asn Asn 








20 










25 


His 


Phe 


Trp 


He 


Val 


Phe 


Val 


Met 


Pro Arg 






35 










40 




Cys 


Glu 


Asp 


Cys 


Val 


Ala 


Ala 


He 


Leu Lys 


50 










55 






He 


He 


Asn 


Arg 


Thr 


Ser Val Gly Ser Leu 


65 










70 








Met 


Asp 


Ser 


Val 


Val 


Leu Asn Ala Gly Leu 








85 








90 


Thr 


He 


Gly 


Ser 


Asp 


Lys 


Gly 


Cys 


Ser Gin 








100 










105 


Leu 


Ser 


Leu 


His 


Tyr 


Pro 


Leu 


Glu 


He Ser 



15 

Lys Gly Gly Leu Leu Va! 
30 

Ala Lys Gly His He Ph( 
45 

Asp Ser He Gin Thr Sei 
60 

Gin Gly Leu Ala Val As] 
75 80 
Arg Ser Asp Tyr Ser Se: 

95 

Tyr Phe Tyr Ala Glu Hii 
110 

Ala Ala Ser Gly Arg Lei 
115 " 120 125 



1920 



gat gee tgc aaa gga gat teg ggt gga cct tta tct tgt cga aga aaa 1968 
Asp Ala Cys Lys Gly Asp Ser Gly Gly Pro Leu Ser Cys Arg Arg Lys 

645 6S0 655 

agt gat gga aaa tgg att ttg act ggc att gtt age tgg gga cat gga 2016 
Ser ASP Gly Lys Trp He Leu Thr Gly He Val Ser Trp Gly Hxs Gly 
660 665 670 

tgt gga cga cca aac ttt cct ggt gtt tac aca agg gtg tea aac ttt 
Cys Gly Arg Pro Asn Phe Pro Gly Val Tyr Thr Arg Val Ser Asn Phe 
675 680 685 

gtt cec tgg att eat aaa tat gtc cct tct ctt ttg taa t tgcaaaaaaa 
Val Pro Trp He His Lys Tyr Val Pro Ser Leu Leu * 
690 695 700 



2064 



2114 



2130 
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Met Cys His Phe Lys Leu Val Ala He Val Gly Tyr Leu lie Arg Leu 

130 135 140 

Ser He Lys Ser He Gin He Glu Ala Asp Asn Cys Val Thr Asp Ser 
145 150 155 160 

Leu Thr He Tyr Asp Ser Leu Leu Pro He Arg Ser Ser He Leu Tyr 

165 170 175 

Arg He Cys Glu Pro Thr Arg Thr Leu Met Ser Phe Val Ser Thr Asn 

180 185 190 

Asn Leu Met Leu Val Thr Phe Lys Ser Pro His He Arg Arg Leu Ser 

195 200 205 

Gly He Arg Ala Tyr Phe Glu Val He Pro Glu Gin Lys Cys Glu Asn 

210 215 220 

Thr Val Leu Val Lys Asp He Thr Gly Phe Glu Gly Lys He Ser Ser 
225 230 235 240 

Pro Tyr Tyr Pro Ser Tyr Tyr Pro Pro Lys Cys Lys Cys Thr Trp Lys 

245 250 255 

Phe Gin Thr Ser Leu Ser Thr Leu Gly He Ala Leu Lys Phe Tyr Asn 

260 265 270 

Tyr Ser He Thr Lys Lys Ser Met Lys Gly Cys Glu His Gly Trp Trp 

275 280 285 

Glu He Tyr Glu His Met Tyr Cys Gly Ser Tyr Met Asp His Gin Thr 

290 295 300 

He Phe Arg Val Pro Ser Pro Leu Val His He Gin Leu Gin Cys Ser 
305 310 315 320 

Ser Arg Leu Ser Gly Lys Pro Leu Leu Ala Glu Tyr Gly Ser Tyr Asn 

325 330 335 

He Ser Gin Pro Cys Pro Val Gly Ser Phe Arg Cys Ser Ser Gly Leu 

340 345 350 

Cys Val Pro Gin Ala Gin Arg Gly Asp Gly Val Asn Asp Cys Phe Asp 

355 360 365 

Glu Ser Asp Glu Leu Phe Cys Val Ser Pro Gin Pro Ala Cys Asn Thr 

370 375 380 

Ser Ser Phe Arg Gin His Gly Pro Leu He Cys Asp Gly Phe Arg Asp 
385 390 395 400 

Cys Glu Asn Gly Arg Asp Glu Gin Asn Cys Thr Gin Ser He Pro Cys 

405 410 415 

Asn Asn Arg Thr Phe Lys Cys Gly Asn Asp He Cys Phe Arg Lys Gin 

420 425 430 

Asn Ala Lys Cys Asp Gly Thr Val Asp Cys Pro Asp Gly Ser Asp Glu 

435 440 445 

Glu Gly Cys Thr Cys Ser Arg Ser Ser Ser Ala Leu His Arg He He 

450 455 460 

Gly Gly Thr Asp Thr Leu Glu Gly Gly Trp Pro Trp Gin Val Ser Leu 
465 470 475 480 

His Phe Val Gly Ser Ala Tyr Cys Gly Ala Ser Val He Ser Arg Glu 

485 490 495 

Trp Leu Leu Ser Ala Ala His Cys Phe His Gly Asn Arg Leu Ser Asp 

500 505 510 

Pro Thr Pro Trp Thr Ala His Leu Gly Met Tyr Val Gin Gly Asn Ala 

515 520 525 

Lys Phe Val Ser Pro Val Arg Arg He Val Val His Glu Tyr Tyr Asn 

530 535 540 

Ser Gin Thr Phe Asp Tyr Asp He Ala Leu Leu Gin Leu Ser He Ala 
545 550 555 560 

Trp Pro Glu Thr Leu Lys Gin Leu He Gin Pro He Cys He Pro Pro 

565 570 575 

Thr Gly Gin Arg Val Arg Ser Gly Glu Lys Cys Trp Val Thr Gly Trp 

580 585 590 

Gly Arg Arg His Glu Ala Asp Asn Lys Gly Ser Leu Val Leu Gin Gin 

595 600 605 

Ala Glu Val Glu Leu He Asp Gin Thr Leu Cys Val Ser Thr Tyr Gly. 
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610 

lie He Thr Ser Arg Met 
625 630 
Asp Ala Cys Lye Gly Asp 

645 

Ser Asp Gly Lys Trp He 
660 

Cys Gly Arg Pro Asn Phe 
675 

Val Pro Trp He His Lys 
690 

<210> 46 
<211> 8 
<212> PRT 
<213> Artifcial sequence 

<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<221> MOD_RES 
<222> 4 

<223> Xaa is Quat : (R) -Glu (alpha- (3 -amidinobenzyl) ) 

<221> MOD_RES 
<222> 8 

<223> Alanine -therapeutic agent 
<400> 46 

Leu Arg Ala Xaa Gly Arg Ala Xaa 
1 5 

<210> 47 
<211> 8 
<212> PRT 

<213> Artifcial sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 4 

<223> Quat: (R) -Glu (alpha- (3 -amidinobenzyl) ) 

<221> MOD_RES 
<222> 8 

<223> Xaa is Alanine- therapeutic agent 
<400> 47 

Leu Arg Ala Xaa Ala Arg Ala Xaa 
1 5 

<210> 46 
<211> 8 
<212> PRT 



615 620 

Leu Cys Ala Gly He Met Ser Gly Lys Arg 

635 640 
Ser Gly Gly Pro Leu Ser Cys Arg Arg Lys 

650 655 
Leu Thr Gly He Val Ser Trp Gly His Gly 

665 670 
Pro Gly Val Tyr Thr Arg Val Ser Asn Phe 

680 685 
Tyr Val Pro Ser Leu Leu 
695 700 
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<213> Artifcial sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 4 

<223> Xaa is Quat : (R) -Glu (alpha- (3-amidinobenzyl) ) 

<221> MOD_RES 
<222> 8 

<223> Alanine -therapeutic agent 
<400> 48 

Leu Arg Ser Xaa Gly Arg Ala Xaa 
1 5 

<210> 49 
<211> 8 
<212> PRT 

<213> Artifcial sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 4 

<223> Xaa is Quat: (R) -Glu (alpha- (S-amidinobenzyl) ) 

<221> MOD_RES 
<222> 8 

<223> Alanine -therapeutic agent 
<400> 49 

Leu Arg Ser Xaa Ala Arg Ala Xaa 
1 5 

<210> 50 
<211> 8 
<212> PRT 

<213> Artifcial sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<221> MOD_RES 

<222> e 

<223> I soleucine- therapeutic agent 
<400> 50 

Leu Arg Pro Arg Phe Lys lie Xaa 
1 5 
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<210> 51 
<211> 7 
<212> PRT 

<213> Artifcial sequence 
<220> 

<223> Conjugate 

<:221> ACETYIATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> I soleucine- therapeutic agent 
<400> 51 

Arg Pro Arg Phe Lys lie Xaa 
1 5 

<210> 52 
<211> 6 
<212> PRT 

<213> Artifcial sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> I soleucine- therapeutic agent 
<400> 52 

Pro Arg Phe Lys lie Xaa 
1 5 

<210> 53 
<211> 8 
<212> PRT 

<213> Artifcial sequence 
<220> 

<223> Conjugate 

<221> ACETYIATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Alanine -therapeutic agent 
<400> 53 

Leu Arg Ser Lys Ser Arg Ala Xaa 
1 5 

<2a0> 54 
<211> 7 
<212> PRT 

<213> Artifcial sequence 
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<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_IlES 
<222> 7 

<223> Alanine-therapeutic agent 
<400> 54 

Arg Ser Lys Ser Arg Ala Xaa 
1 5 

<210> 55 
<211> 6 
<212> PRT 

<213> Artifcial sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Alanine-therapeutic agent 
<400> 55 

Ser Lys Ser Arg Ala Xaa 
1 5 

<210> 56 
<211> 8 
<212> PRT 

<213> Artifcial sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> I soleucine- therapeutic agent 
<400> 56 

Leu Arg Pro Arg Phe Arg lie Xaa 
1 5 

<210> 57 
<211> 7 
<212> PRT 

<213> Artifcial sequence 
<220> 

<223> Conjugate 



<221> ACETYIiATION 
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<222> 1 

<221> MOD_RES 
<222> 7 

<223> laoleucine- therapeutic agent 
<400> 57 

Arg Pro Arg Phe Arg lie Xaa 
1 5 

<210> 58 
<211> 6 
<212> PRT 

<213> Artifcial sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> € 

<223> I soleucine- therapeutic agent 
<400> 58 

Pro Arg Phe Arg lie Xaa 
1 5 

<210> 59 
<211> 8 
<212> PRT 

<213> Artifcial sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> I soleucine -therapeutic agent 
<400> 59 

Leu Arg Ser Arg Ser Arg Ala Xaa 
1 5 

<210> 60 
<211> 7 
<212> PRT 

<213> Artifcial sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD__RES 
<222> 7 
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<223> Alanine -therapeutic agent 
<400> 60 

Arg Ser Arg Ser Arg Ala Xaa 
1 5 

<210> 61 
<211> 6 
<212> PRT 

<213> Artifcial sequence 
<220> 

<223> Conjugate 

<221> ACETYIATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Alanine -therapeutic agent 
<400'> 61 

Ser Arg Ser Arg Ala Xaa 
1 S 

<210> 62 
<211> 8 
<212> PRT 

<:213> Artifcial sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 4 

<223> Xaa is Quat : (R) -Glu (alpha- {3-amidinobenzyl) ) 

<221> MOD_RES 
<222> 6 

<223> Alanine -therapeutic agent 
<400> 62 

Leu Arg Ala Xaa Gly Arg Ala Xaa 
1 5 

<210> 63 
<211> 8 
<212> PRT 

<213> Artifcial sequence 
<220> 

<223> Conjugate 

<221> ACETYliATION 
<222> 1 

<221> MOD_RES 
<222> 4 
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<223> Xaa is Quat : (R) -Glu (alpha- (3-amidinobenzyl) ) 

<221> MOD RES 
<222> 8 

<223> Alanine -therapeutic agent 
<400> 63- 

Leu Arg Ala Xaa Ala Arg Ala Xaa 
1 5 

<210> 64 
<211> 8 
<212> PRT 

<213> Artifcial sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD__RES 
<222> 4 

<223> Xaa is Quat: (R) -Glu (alpha- (3-ainidinobenzyl) ) 

<221> MOD_RES 
<222> B 

<223> Alanine -therapeutic agent 
<400> 64 

Leu Arg Ser Xaa Gly Arg Ala Xaa 
1 5 

<210> 65 
<211> 8 
<212> PRT 

<213> Artifcial sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 4 

<223> Xaa is Quat: 

(R) -Glu (alpha - (3-amidinobenzyl) ) 

<221> MOD_RES 
<222> 8 

<223> Alanine -therapeutic agent 
<400> 65 

Leu Arg Ser Xaa Ala Arg Ala Xaa 
1 5 

<210> 66 
<211> 8 
<212> PRT 

<213> Artifcial sequence 
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<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD__RES 
<222> 8 

<223> I soleucine- therapeutic agent 
<400> 66 

Leu Arg Pro Arg Phe Lys lie Xaa 
1 5 

<210> 67 
<211> 7 
<212> PRT 

<213> Artifcial sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> I soleucine -therapeutic agent 
<400> 67 

Arg Pro Arg Phe Lys lie Xaa 
1 5 

<210> 68 
<211> 6 
<212> PRT 

<213> Artifcial sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Isoleucine- therapeutic agent 
<400> 68 

Pro Arg Phe Lys lie Xaa 
1 5 

<210> 69 
<211> 8 
<212> PRT 

<213> Artifcial sequence 
<220> 

<223> Conjugate 



<221> ACETYLATION 
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<222> 1 

<221> MOD_RES 
<222> 8 

<223> Alanine -therapeutic agent 
<400> 69 

Leu Arg Ser Lys Ser Arg Ala Xaa 
1 5 

<210> 70 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Alanine- therapeutic agent 
<400> 70 

Arg Ser Lys Ser Arg Ala Xaa 
1 5 

<210> 71 
<211> 6 
<212> PRT 

<213> Artifical sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> (1) . . . (0) 

<221> MOD_RES 
<222> 6 

<223> Alanine- therapeutic agent 
<400> 71 

Ser Lys Ser Arg Ala Xaa 
1 5 

<210> 72 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 
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<223> I SO leucine -therapeutic agent 
<400> 72 

Leu Arg Pro Arg Phe Arg lie Xaa 
1 5 

<210> 73 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> I soleucine -therapeutic agent 
<400> 73 

Arg Pro Arg Phe Arg lie Xaa 
1 5 

<210> 74 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> I soleucine -therapeutic agent 
<400> 74 

Pro Arg Phe Arg lie Xaa 
1 5 

<210> 75 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Alanine -Therapeutic Agent 
<400> 75 

Leu Arg Ser Tirg Ser Arg Ala Xaa 
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1 5 

<210> 76 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Alanine -therapeutic agent 
<400> 76 

Arg Ser Arg Ser Arg Ala Xaa 
1 5 

<210> 77 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Alanine -therapeutic agent 
<400> 77 

Ser Arg Ser Arg Ala Xaa 
1 5 

<210> 78 

<211> 5 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD RES 
<222> 1 ~ 

<223> Xaa is pyroglutamic acid 

<221> MOD_RES 
<222> 5 

<223> Alanine -therapeutic agent 
<400> 78 

Xaa Pro Arg Ala Xaa 
1 5 



<210> 79 
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<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is CH3S02-D-HHT : 

HHT is hexahydro tyro sol 

<221> MOD_RES 
<222> 5 

<223> Alanine -therapeutic agent 
<400> 79 

Xaa Gly Arg Ala Xaa 
1 5 

<210> 80 
<211> 5 
<212> PRT 

<213> Artificial' Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is N-p- tosyl-Gly 

<;22X> MOD_RES 
<222> 5 

<223> Alanine -therapeutic agent 
<400> 80 

Xaa Pro Arg Ala Xaa 
1 5 

<210> 81 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is Benzoyl-Val 

<221> MOD_RES 
<222> 5 

<223> Alcinine-therapeutic agent 
<400> 81 

Xaa Gly Arg Ala Xaa 
1 5 



<210> 82 
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<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is CH3S02-D-HHT: 

HHT is hexahydrotyrosyl 

<221> MOD_RES 
<222> 5 

<223> Alanine- therapeutic agent 
<400> 82 

Xaa Gly Arg Ala Xaa 
1 5 

<210> 83 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is N-aplha-Z-D-Arg : 
Z is benzyloxycarbonyl 

<221> MOD_RES 
<222> 5 

<223> Alanine -therapeutic agent 
<400> 83 

Xaa Gly Arg Ala Xaa 
1 5 



<210> 


84 




<211> 


5 




<212> 


PRT 




<213> 


Artificial Sequence 




<220> 






<223> 


Conj ugate 




<221> 


NOD RES 




<222> 


1 




<223> 


Xaa is pyroglutamic 


acid 


<221> 


MOD RES 




<222> 


5 




<223> 


Alanine - therapeutic 


agent 



<400> 84 

Xaa Gly Arg Ala Xaa 
1 5 
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<210> 85 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-Ile 

<221> MOD_RES 
<222> 5 

<223> Alanine- therapeutic agent 
<400> 85 

Xaa Pro Axg Ala Xaa 
1 5 

<210> 86 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<22X> MOD_RES 
<222> 1 

<223> Xaa is Cbo-L- (gamma) Glu (alpha- t-BuO) : 
Cbo is carbobenzoxy 

<221> MOD_RES 
<222> 5 

<223> Alanine -therapeutic agent 
<400> 86 

Xaa Arg Ala Ala Xaa 
1 5 

<210> 87 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-Pro 

<221> MOD_RBS 
<222> 5 

<223> Alanine- therapeutic agent 



<400> 87 

Xaa Phe Arg Ala Xaa 
1 5 
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<210> 88 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-Val 

<221> MUTAGEN 
<222> 5 

<223> Alanine- therapeutic agent 
<400> 6B 

Xaa Leu Arg Ala Xaa 
1 5 

<210> 89 
<211> 6 
<212> PRT 

<213> Artificial Secfuence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is Bz-Ile: 
Bz is benzoyl 

<221> MOD_RES 
<222> 6 

<:223> Alanine -therapeutic agent 
<400> 89 

Xaa Glu Gly Arg Ala Xaa 
1 5 

<210> 90 
<211> 6 
<:212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is Bz-Ile: 
Bz is benzoyl 

<221> MOD_RES 
<222> 6 

<223> Alanine- therapeutic agent 



<400> 90 

Xaa Xaa Gly Arg Ala Xaa 
1 5 



wo 02/095007 



PCT/US02/16819 



-106- 



<210> 91 
<211> 5 
<212> PRT 

<:213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> M0D_IIES 
<222> 1 

<223> Xaa is Benzoyl -Pro 

<221> MOD_RES 
<222> 5 

<223> Alanine -therapeutic agent 
<400> 91 

Xaa Phe Arg Ala Xaa 
1 5 

<210> 92 

<211> 5 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-Phe 

<221> MOD_RES 
<222> 2 

<223> pipecolinic acid 

<221> MOD_RES 
<222> 5 

<223> Alanine-therapeutic agent 
<400> 92 

Xaa Xaa Arg Ala Xaa 
1 5 

<210> 93 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-Val 

<221> MOD_RES 
<222> 5 

<223> Alanine-therapeutic agent 
<400> 93 
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Xaa Leu Lys Ala Xaa 
1 5 

<210> 94 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-Nle 

<221> MOD_RES 
<222> 2 
<223> HET: 

HHT is hexahydrotyrosyl 

<221> MOD_RES 
<222> 5 

<223> Alanine -therapeutic agent 
<400> 94 

Xaa Xaa Lys Ala Xaa 
1 5 

<210>" 95 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is pyroglutamic acid 

<221> MOD_RES 
<222> 7 

<223> Alanine -therapeutic agent 
<400> 95 

Xaa Arg Thr Lys Arg Ala Xaa 
1 S 

<210> 96 
<211> 6 
<212> PRT 

<213> Artificial Sec[uence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-Arg 
<221> MOD RES 
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<222> 6 

<223> Alanine -therapeutic agent 
<400> 96 

Xaa Gin Axg Arg Ala Xaa 
1 5 

<210> 97 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_EES 
<222> 1 

<223> Xaa is Boc-Gln: 

Boc is t- but oxycar bony 1 

<221> MOD_RES 
<222> 5 

<223> Alanine -therapeutic agent 
<400> 97 

Xaa Gly Arg Ala Xaa 
1 5 

<210> 98 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD__RES 
<222> 1 

<223> Xaa is Z-Arg: 

Z is benzyl oxycarbonyl 

<221> MOD_RES 
<222> 4 

<223> Alcinine- therapeutic agent 

<400> 98 
Xaa Arg Ala Xaa 
1 

<210> 99 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-HHT: HHT is hexahydro tyro sol 
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<221> MOD_RES 
<222> 5 

<223> Alanine- therapeutic agent 
<400> 99 

Xaa Ala Arg Ala Xaa 

1 5 

<210> 100 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-CHT: 

HHT is hexahydrotyrosyl 

<221> MOD_KES 
<222> 5 

<223> Alanine -therapeutic agent 
<400> 100 

Xaa Gly Arg Ala Xaa 
1 5 

<210> 101 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<:223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is MeS02-D-Phe 



<221> MOD_RES 
<222> 5 

<223> Alanine -therapeutic agent 
<400> 101 

Xaa Pro Arg Ala Xaa 
1 5 

<210> 102 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<i222> 1 
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<223> Xaa is delta-Z-D-Lys : Z is benzyl oxycarbonyl 

<221> MOD_RES 
<222> 5 

<223> Alanine -therapeutic agent 
<400> 102 

Xaa Pro Arg Ala Xaa 
1 5 

<210> 103 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is CH3S02-D-CHA: 

CHA is cyclohexylalanyl 

<221> MOD_RES 
<222> 2 

<223> Xaa is But -Arg 

<221> MOD_RES 
<222> 4 

<223> Alanine -therapeutic agent 

<400> 103 
Xaa Xaa Ala Xaa 
1 

<210> 104 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> m6d_RES 
<222> 6 

<223> Alanine -therapeutic agent 
<400> 104 

Arg Gin Ser Arg Ala Xaa 
1 5 

<210> lOS 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 
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<221> ACETYliATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Ala -therapeutic agent 
<400> 105 

Arg Arg Gin Ser Arg Ala Xaa 
1 5 

<210> 106 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Alanine -therapeutic agent 
<400> 106 

Leu Arg Arg Gin Ser Arg Ala Xaa 
1 5 

<210> 107 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Alanine -therapeutic agent 
<400> 107 

Arg Gin Ser Arg Xaa 
1 5 

<210> 108 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_EES 
<222> 1 

<221> MOD RES 
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<222> € 

<223> Alanine- therapeutic agent 
<400> 108 

Arg Arg Gin Ser Arg Xaa 
1 5 

<210> 109 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Glycine- therapeutic agent 
<400> 109 

Leu Arg Arg Gin Ser Arg Gly Xaa 
1 5 

<210> 110 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> (0)...(0) 

<221> MOD_RES 
<222> 7 

<223> Alanine -therapeutic agent 
<400> 110 

Leu Arg Arg Gin Ser Arg Xaa 
1 5 

<210> 111 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> (0) . . , (0) 

<221> MOD_RES 
<222> 6 

<223> I soleucine- therapeutic agent 
<400> 111 
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Arg Arg Gin Ser Arg Xaa 
1 5 

<210> 112 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<222> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> I soleucine- therapeutic agent 
<400> 112 

Leu Arg Arg Gin Ser Arg Ala Xaa 
. 1 5 

<210> 113 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<222> Conjugate 

<221> ACETYIiATION 
<222> (1) - . . (0) 

<221> MOD_RES 

<223> Xaa is Quat : (R) -Glu (Alpha- O-amidinobenzyl) 

<221> MOD_RES 
<222> 8 

<22 3> Leucine -therapeutic agent 
<400> 113 

Leu Arg Ala Xaa Gly Arg Ser Xaa 
1 5 

<210> 114 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RSS 

<222> 4 . . , * V 

<223> Xaa is Quat: (R) -Glu (alpha- (3-araidinobenzyl) ) 

<221> MOD RES 
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<222> 8 

<223> Leucine -therapeutic agent 
<400> 114 

Leu Arg Ala Xaa Ala Arg Ser Xaa 
1 5 

<210> 115 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 4 

<223> Xaa is Quat : (R) -Glu (alpha- (3-amidinobenzyl) ) 

<221> MOD_RES 
<222> 8 

<223> Leucine- therapeutic agent 
<400> 115 

Leu Arg Ser Xaa Gly Arg Ser Xaa 
1 5 

<210> 116 
<211> B 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 4 

<223> Xaa is Quat: (R) -Glu (alpha- {3-amidinobenzyl) ) 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 116 

Leu Arg Ser Xaa Ala Arg Ser Xaa 
1 5 

<210> 117 
<211> 8 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Conjugate 
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<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 117 

Leu Arg Pro Arg Phe Lys Ser Xaa 
1 5 

<210> 118 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine- therapeutic agent 
<400> 118 

Arg Pro Arg Phe Lys Ser Xaa 
1 5 

<210> 119 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine- therapeutic agent 
<400> 119 

Pro Arg Phe Lys Ser Xaa 
1 5 

<210> 120 
<211> 8 
<212> PRT 

<:213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD RES 
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<222> 8 

<223> Leucine- therapeutic agent 
<400> 120 

Leu Arg Ser Lys Ser Arg Ser Xaa 
1 5 

<210> 121 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 121 

Arg Ser Lys Ser Arg Ser Xaa 
1 5 

<210> 122 
<211> 6 
<212> PRT 

<213> Aritificial sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 122 

Ser Lys Ser Arg Ser Xaa 
1 5 

<210> 123 

<211> 8 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 



<400> 123 
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Iieu Arg Pro Arg Phe Arg Ser Xaa 
1 5 

<210> 124 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> (0) . . . (0) 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 124 

Arg Pro Arg Phe Arg Ser Xaa 

1 5 

<210> 125 , 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 125 

Pro Arg Phe Arg Ser Xaa 
1 5 

<210> 126 

<211> 8 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> (0) . . . (0) 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 126 

Leu Arg Ser Arg Ser Arg Ser Xaa 
1 5 



<210> 127 
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<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 127 

Arg Ser Arg Ser Arg Ser Xaa 
1 5 

<210> 128 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<22i> ACETYLATION 
<222> 1 

<221> HOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 128 

Ser Arg Ser Arg Ser Xaa 
1 5 

<210> 129 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> (0) . . . (0) 

<221> MOD_RES 
<222> 4 

<223> Xaa is Quat: (R) -Glu (alpha- O-amidinobenzyl) ) 

<221> MOD_RES 
<222> 8 

<223> Leucine- therapeutic agent 
<400> 129 

Leu Arg Ala Xaa Gly Arg Ser Xaa 
1 5 



<210> 130 
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<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 4 

<223> Xaa is Quat : (R) -Glu (alpha- (3-amidinobenzyl) ) 

<221> MOD_RES 
<222> 8 

<223> Leucine-therapeutic agent 
<400> 130 

Leu Arg Ala Xaa Ala Arg Ser Xaa 
1 5 

<210> 131 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIATION 
<222> 1 

<221> MOD_HES 
<222> 4 

<223> Xaa is Quat: <R) -Glu (alpha- (3 -amidinobenzyl) ) 

<221> MOD_RES 
<222> 8 

<223> Leucine-therapeutic agent 
<400> 131 

Leu Arg Ser Xaa Gly Arg Ser Xaa 
1 5 

<210> 132 

<211> 8 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 4 

<223> Xaa is Quat: (R) -Glu (alpha- (3 -amidinobenzyl) ) 
<221> MOD RES 
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<222> 8 

<223> Leucine- therapeutic agent 
<400> 132 

Leu Arg Ser Xaa Ala Arg Ser Xaa 

1 ^ 5 

<210> 133 
<211> 8 
<212> PRT 

<213> Ar-tificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 133 

Leu Arg Pro Arg Phe Lys Ser Xaa 
1 5 

<210> 134 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 134 

Arg Pro Arg Phe Lys Ser Xaa 
1 5 

<210> 135 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<:220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 



<400> 135 
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Pro Arg Phe Lys Ser Xaa 
1 5 

<210> 136 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_R£:S 
<2227- 8 

<223> Leucine -therapeutic agent 
<400> 136 

Leu Arg Ser Lys Ser Arg Ser Xaa 
1 5 

<210> 137 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIATION 
<222> 1 

<221> MOD_EES 
<222> 7 

<223> Leucine- therapeutic agent 
<400> 137 

Arg Ser Lys Ser Arg Ser Xaa 
1 5 

<210> 138 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 138 

Ser Lys Ser T^rg Ser Xaa 
1 5 



<210> 139 
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<211> 8 

<212> PRT 

<:213> Artificial Sequence 
<:220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 139 

Leu Arg Pro Arg Phe Arg Ser Xaa 
1 5 

<:210> 140 
<:211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

- <223> Leucine -therapeutic agent 
<400> 140 

Arg Pro Arg Phe Arg Ser Xaa 
1 5 

<210> 141 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 141 

Pro Arg Phe Arg Ser Xaa 
1 5 

<210> 142 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
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<220> 

< 2 2 3 > Con j ugat e 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 142 

Leu Arg Ser Arg Ser Arg Ser Xaa 
1 5 

<210> 143 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 143 

Arg Ser Arg Ser Arg Ser Xaa 
1 5 

<210> 144 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 144 

Ser Arg Ser Arg Ser Xaa 
1 5 

<210> 145 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 
<221> MOD RES 
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<222> 1 

<223> Xaa is pyroglutamic acid 

<221> MOD_RES 
<222> e 

<223> Leucine- therapeutic agent 
<400> 145 

Xaa Pro Arg Ser Xaa 
1 S 

<210> 146 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD RES 
<222> 1 

<223> Xaa is CH3S02-D-HHT; 

HHT i3 hexahydrotyrosyl 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<400> 146 

Xaa 61y Arg Ser Xaa 
1 5 

<210> 147 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MODJRES 
<222> 1 

<223> Xaa is n-p-tosyl-Gly 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<400> 147 

Xaa Pro Arg Ser Xaa 
1 5 

<210> 148 
<211> 5 
<212> PRT 

<213> Aritficlal sequence 
<220> 

<223> Conjugate 
<221> MOD RES 
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<222> 1 

<223> Xaa is Benzoyl -Val 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<400> 148 

Xaa Gly Arg Ser Xaa 
1 5 

<210> 149 
<211> 5 
-<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is CH3S02-D-HHT; 

HHT is hexahydxotyrosyl 

<221> MOD_RES 
<222> 5 

<223> Leucine- therapeutic agent 
<400> 149 

Xaa Gly Arg Ser Xaa 
1 5 

<210> 150 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD__RES 
<222> 1 

<223> Xaa is ' N- alpha- Z-D- Arg; 
Z is benzyloxycarbonyl 

<221> MOD_RES 
<222> 5 

<223> Leucine- therapeutic agent 
<400> 150 

Xaa Gly Arg Ser Xaa 
1 5 

<210> 151 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 
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<221> MOD_RES 
<222> 1 

<223> Xaa is pyroglutamic acid 

<221> MOD RES 
<222> 5 " 

<223> Leucine-therapeutic agent 
<400> 151 

Xaa Gly Arg Ser Xaa 
1 5 

<210> 152 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-Ile 

<221> MOD_RES 
<222> 5 

<223> Leucine-therapeutic agent 
<400> 152 

Xaa Pro Arg Ser Xaa 
1 5 

<210> 153 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is Cbo-L- (gamma) Glu (alpha- t-BuO) ; 
Cbo is carbobenzoxy 

<221> MOD_RES 
<222> 5 

<223> Leucine-therapeutic agent 
<400> 153 

Xaa Gly Arg Ser Xaa 
1 5 

<210> 154 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 
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<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-Pro 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<400> 154 

Xaa Fhe Arg Ser Xaa 
1 5 

<210> 155 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-Val 

<221> MOD_RES 
<222> 5 

<2 2 3 > Leucine - therapeut i c agent 
<400> 155 

Xaa Leu Arg Ser Xaa 
1 5 

<210> 156 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is Bz-Ile; 
Bz is benzoyl 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 156 

Xaa Glu Gly Arg Ser Xaa 
1 S 

<210> 157 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 
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<221> MOD_RES 
<222> 1 

<223> Xaa is Bz-Ile 

<221> MOD_RES 
<222> 2 

<223> Xaa is Glu (gamma -OMe) 

<221> MOD RES 
<222> 6 " 

<223> Leucine -therapeutic agent 
<400> 157 

Xaa Xaa Gly Arg Ser Xaa 
1 5 

<210> 158 
<211> S 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is benzoyle-Pro 

<221> MOD_RES 
<222> 5 

<223> lieucine -therapeutic agent 
<400> 158 

Xaa Phe Aorg Ser Xaa 
1 5 

<210> 159 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-Phe 

<221> MOD_RES . 
<222> 2 

<223> Xaa is Pip is pipecolinic acid 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic acid 
<400> 159 

Xaa Xaa Arg Ser Xaa 
1 5 

<210> 160 
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<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-Val 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic acid 
<400> 160 

Xaa Leu Lys Ser Xaa 
1 5 

<210> 161 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_HES 
<222> 1 

<223> Xaa is H-D-Nle 

<221> MOD_RES 
<222> 2 

<223> Xaa is HHT: hexahydrotyxosyl 

<221> MOD RES 
<222> 5 ~ 

<223> Xaa is leucine-therapeutic agent 
<400> 161 

Xaa Xaa Lys Ser Xaa 
1 5 

<:210> 162 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is pyroglutamic acid 

<221> MOD_RES 
<222> 7 

<223> Leucine-therapeutic agent 
<400> 162 

Xaa Arg Thr Lys Arg Ser Xaa 
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1 5 

<210> 163 
<2X1> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-Arg 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 163 

Xaa Gin Arg Arg Ser Xaa 
1 5 

<210> 164 
<211> 5 
c212> PRT 

<213> Artificial Sequence 
<220> 

c223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is Boc-Gln. 

<221> MOD_RES 
<222> 5 

<223> Xaa is Leucine -therapeutic agent 
<400> 164 

Xaa Gly Arg Ser Xaa 
1 S 

«:210> 165 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<:220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is Z-Arg: 

Z is benzyloxycarbonyl 

<221> MOD_RES 
<222> 4 

<223> Iieucine- therapeutic agent 



<400> 165 

Xaa Arg Ser Xaa 
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1 

<210> 166 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_EES 
<222> 1 

<223> Xaa is H-D-HHT: HHT is hexahydrotyrosyl 

<:221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<400> 166 

Xaa Ala Arg Ser Xaa 
1 5 

<210> 167 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-CHT: CHT is hexahydrotyrosyl 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<400> 167 

Xaa Gly Arg Ser Xaa 
1 5 

> 

<210> 168 
<211> 5 
<212> PRT 

<213> Tkrtificial Sequence 
<220> 

<223> Conjugate 

<221> MOD^RES 
<222> 1 

<223> Xaa is MeS02-dPhe 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 



<400> 168 

Xaa Pro Arg Ser Xaa 
1 5 
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<210> 169 
<211> 5 
<2X2> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is delta-Z-D-Lys : Z is benzyloxycarbonyl 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<400> 169 

Xaa Pro Arg Ser Xaa 
1 5 

<210> 170 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<:220> 

< 2 2 3 > Con j ugat e 

<221> MOD_RES 
<222> 1 

<223> Xaa is CH3S02-D-CHA: CHA is cyclohexylalanyl 



<221> MOD__RES 
<222> 2 

<223> Xaa is But -Arg 

<221> MOD_RES 
<222> 4 

<223> Leucine -therapeutic agent 

<400> 170 
Xaa Xaa Ser Xaa 
1 

<210> 171 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 



<400> 



171 
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Arg Gin Ser Arg Ser Xaa 
1 5 

<210> 172 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 172 

Arg Arg Gin Ser Arg Ser Xaa 
1 5 

<210> 173 
<211> 8 
<212> PRT 

<213> Artificail sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 173 

Leu Arg Arg Gin Ser Arg Ser Xaa 
1 5 

<210> 174 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<400> 174 

Arg Gin Ser Arg Xaa 
1 5 



<210> 175 
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<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 175 

Arg Arg Gin Ser Arg Xaa 
1 5 

<210> 176 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 176 

Leu Arg Arg Gin Ser Arg Ser Xaa 
1 5 

<210> 177 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 177 

Leu Arg Arg Gin Ser Arg Xaa 
1 5 

<210> 178 
<211> 6 
<212> PRT 

<:213> Artificial Sequence 
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<220> 

<223> Conjugate 

«:221> ACETYLATION 
<:222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 178 

Arg Arg Gin Ser Arg Xaa 
1 5 

<210> 179 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 9 

<223> Leucine -therapeutic agent 
<400> 179 

Leu Arg Arg Gin Ser Arg Ser Xaa 
1 5 

<210> 180 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 180 

Arg Gin Gly Arg Ser Xaa 
1 S 

<210> 181 
<211> 6 
<21.2> PRT 

<213> Artificial Secjuence 
<220> 

<223> Conjugate 



<221> ACETYLATION 
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<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 181 

Arg Gin Ala Arg Ser Xaa 
1 5 

<210> 182 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 182 

Arg Gin Fhe Arg Ser Xaa 
1 5 

<210> 183 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<400> 183 

Arg Ser Arg Ser Xaa 
1 5 

<210> 184 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 



<221> MOD_RES 
<222> 5 
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<223> Leucine-therapeutic agent 
<400> 184 

Arg Gly Arg Ser Xaa 
1 5 

<210> 185 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<;220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Leucine-therapeutic agent 
<400> 185 

Arg Ala Arg Ser Xaa 
1 5 

<210> 186 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Leucine-therapeutic agent 
<400> 186 

Arg Phe Arg Ser Xaa 
1 5 

<210> 187 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Leucine-therapeutic agent 
<400> 187 

Gin Ser Arg Ser Xaa 
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1 5 

<210> 188 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD__RES 
<222> 5 

<223> Leucine -therapeutic agent 
<4O0> 188 

Gin Gly Arg Ser Xaa 
1 5 

<210> 189 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<400> 189 

Gin Ala Arg Ser Xaa 
1 5 

<210> 190 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Leucine- therapeutic agent 
<400> 190 

Gin Phe Arg Ser Xaa 
1 5 

<210> 191 
<211> 9 
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<212> PRT 

<213> Artificial Sequence 
<220> 

< 2 2 3 > Con j uga t e 

<22a.> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<:222> 4 

<223> Xaa is Quat: (R) -Glu (Alpha- O-amidinobenzyl) 

<221> MOD_RES 
<222> 9 

<223> Leucine -therapeutic agent 
<400> 191 

Leu Arg Ala Xaa Gly Arg Ser Ser Xaa 
1 5 

<210> 192 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 4 

<223> Xaa is Quat: (R) -Glu (alpha- (3-amidinobenzyl) ) 

<221> MOD_RES 
<222> 9 

<223> Leucine-therapeutic agent 
<400> 192 

Leu Az3 Ala Xaa Ala Arg Ser Ser Xaa 
1 5 

<210> 193 
<211> 9 
<212> PRT 

<213> Artificial Sec[uence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 4 

<223> Xaa is Quat: (R) -Glu (alpha- (3 -amidinobenzyl) ) 

<221> MOD_RES 
<222> 9 
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<223> Leucine- therapeutic agent 
<400> 193 

Leu Arg Ser Xaa Gly Arg Ser Ser Xaa 
1 5 

<210> 194 
<211> 9 
<212> PRT 

^213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 4 

<223> Xaa is Quat : (R) -Glu (alpha- (3-ainidlnobenzyl) ) 

<221> MOD_RES 
<222> 9 

<223> Leucine -therapeutic agent 
<400> 194 

Leu Arg Ser Xaa Ala Arg Ser Ser Xaa 
1 5 

<210> 195 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
*:220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 9 

<223> Leucine- therapeutic agent 
<400> 195 

Leu Arg Pro Arg Phe Lys Ser Ser Xaa 
1 5 

<210> 196 
<211> B 
<212> PRT 

<213> Artificial Sequence 
«;220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
«:222> 8 
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<223> Leucine- therapeutic agent 
<400> 196 

Arg Pro Arg Phe Lys Ser Ser Xaa 
1 5 * 

<210> 197 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<2 2 3 > Con j ugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 197 

Pro Arg Phe Lys Ser Ser Xaa 
1 5 

<210> 198 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 9 

<223> Leucine -therapeutic agent 
<400> 198 

Leu Arg Ser Lys Ser Arg Ser Ser Xaa 
1 5 

<210> 199 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 199 

Arg Ser Lys Ser Arg Ser Ser Xaa 
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1 5 

<210> 200 
<211> 7 
<212> PRT 

<213> Aritificial sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 200 

Ser Lys Ser Arg Ser Ser Xaa 
1 5 

<210> 201 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 9 

<223> Leucine- therapeutic agent 
<400> 201 

Leu Arg Pro Arg Phe Arg Ser Ser Xaa 
1 5 

<210> 202 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 202 

Arg Pro Arg Phe Arg Ser Ser Xaa 
1 5 

<210> 203 
<211> 7 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 203 

Pro Arg Phe Arg Ser Ser Xaa 
1 S 

<210> 204 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> ^30D_RES 
<222> 9 

<223> Leucine -therapeutic agent 
<40a> 204 

Leu Arg Ser Arg Ser Arg Ser Ser Xaa 
1 5 

<210> 205 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 205 

Arg Ser Arg Ser Arg Ser Ser Xaa 
1 5 

<210> 206 
<211> 7 
<212> PRT 

<213> Artificial Sequence 



<220> 



wo 02/095007 



PCT/US02/16819 



-144- 



<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD__RES 
<:222> 7 

<223> Leucine -therapeutic agent 
<400> 206 

Ser Arg Ser Arg Ser Ser Xaa 
1 5 

<210> 207 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> (0) - . . (0) 

<221> MOD_RES 
<222> 4 

<223> Xaa is Quat; {R) -Glu (alpha- (3-amidinobenzyl) ) 

<221> MOD_RES 
<222> 9 

<223> Leucine -therapeutic agent 
<400> 207 

Leu Arg Ala Xaa Gly Arg Ser Ser Xaa 
1 5 

<210> 208 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<:221> ACETYLATION 
<222> 1 

<221> MOD RES 
<222> 4 "" 

<223> Xaa is Quat: (R) -Glu (alpha- (3 -amidinobenzyl) ) 

<221> MOD__RES 
<222> 9 

<223> Leucine -therapeutic agent 
<400> 208 

Leu Arg Ala Xaa Ala Arg Ser Ser Xaa 
1 5 

<210> 209 
<211> 9 



( 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 4 

<223> Xaa is Quat : (R) -Glu (alpha- (3 -ainidinobenzyl) ) 

<221> MOD_RES 
<222> 9 

<223> Leucine- therapeutic agent 
<400> 209 

Leu Arg Ser Xaa Gly Arg Ser Ser Xaa 
1 5 

<210> 210 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 4 

<223> Xaa is Quat: (R) -Glu (alpha- (3-amidinobenzyl) ) 

<221> MOD_RES 
<222> 9 

<223> Leucine -therapeutic agent 
<400> 210 

Leu Arg Ser Xaa Ala Arg Ser Ser Xaa 
1 5 

<210> 211 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 9 

<223> Leucine -therapeutic agent 
<400> 211 

Leu Arg Pro Arg Phe Lye Ser Ser Xaa 
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1 5 

<210> 212 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<:220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 9 

<223> Leucine- therapeutic agent 
<400> 212 

Arg Pro Arg Phe Lys Ser Ser Xaa 
1 5 

<210> 213 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine-tiierapeutic agent 
<400> 213 

Pro Arg Phe Lye Ser Ser Xaa 
1 5 

<210> 214 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 9 

<223> Leucine-therapeutic agent 
<400> 214 

Leu Arg Ser Lys Ser Arg Ser Ser Xaa 
1 5 



<210> 215 
<211> 8 



wo 02/095007 



PCT/US02/16819 



-147- 



<212> PET 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
c222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 215 

Arg Ser Lys Ser Arg Ser Ser Xaa 
1 5 

<210> 216 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

«:221> ACETYLATION 
<222> 1 

<:221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 216 

Ser Lys Ser Arg Ser Ser Xaa 
1 5 

<210> 217 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD RES 
<222> 9 ~ 

<223> Leucine -therapeutic agent 
<400> 217 

Leu Arg Pro Arg Phe Arg Ser Ser Xaa 
1 5 

<210> 218 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 218 

Arg Pro Arg Phe Arg Ser Ser Xaa 
1 5 

<210> 219 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 219 

Pro Arg Phe Arg Ser Ser Xaa 
1 5 

<210> 220 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 9 

<223> Leucine- therapeutic agent 
<400> 220 

Leu Arg Ser Arg Ser Arg Ser Ser Xaa 
1 5 

<210> 221 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 
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<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 221 

Arg Ser Arg Ser Arg Ser Ser Xaa 
1 5 

<210> 222 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 222 

Ser Arg Ser Arg Ser Ser Xaa 
1 5 

<210> 223 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is pyroglutamic acid 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<:400> 223 

Xaa Pro Arg Ser Ser Xaa 
1 5 

<210> 224 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is CH3S02-D-HHT; 

HHT is hexahydrotyroayl 

<221> MOD RES 
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<222> 6 

< 2 2 3 > Leuc ine - ther ap eut i c agent 
<400> 224 

Xaa Gly Arg Ser Ser- Xaa 
1 5 



<210> 225 
<211> 6 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Conjugate 



<221> MOD_RES 
<222> I 

<223> Xaa is n-p-tosyl-Gly 

<221> KOD_RES 
<222> 6 

<223> Leucine- therapeutic agent 
<400> 225 

Xaa Pro Arg Ser Ser Xaa 
1 5 



<210=. 226 
<211> 6 
<212> PRT 

<213> Aritficial sequence 



<220> 

<223> Conjugate 



<221> MOD_RES 
<222> 1 

<223> Xaa is Benzoyl -Val 

<221> MOD_RES 
<222> 6 

<223> Leucine- therapeutic agent 
<400> 226 

Xaa Gly Arg Ser Ser Xaa 
1 5 



<210> 227 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is CH3S02-D-HHT; 

HHT is hexahydrotyrosyl 



<221> MOD RES 
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<222> 6 

<223> Leucine -therapeutic agent 
<400> 227 

Xaa Gly Arg Ser Ser Xaa 
1 5 

<210> 228 

<;211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is N-alpha-Z-D-Arg; 
Z is benzyloxycarbonyl 

<221> MOD RES 
<222> 6 " 

<223> Leucine -therapeutic agent 
<400> 228 

Xaa Gly Arg Ser Ser Xaa 
1 5 

<210> 229 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_EES 
<222> 1 

<223> Xaa is pyroglutamic acid 

<221> MOD_RES 
<222> 6 

<223> Leucine '■therapeutic agent 
<400> 229 

Xaa Gly Arg Ser Ser Xaa 
1 5 

<210> 230 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-Ile 
<221> MOD RES 
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<222> 6 

<223> Leucine -therapeutic agent 
<400> 230 

Xaa Pro Arg Ser Ser Xaa 
1 5 

<210> 231 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD RES 
<222> 1 ~ 

<223> Xaa is Cbo-L- (gamma) Glu (alpha -t-BuO) ; 
Cbo is carbobenzoxy 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 231 

Xaa Gly Arg Ser Ser Xaa 
1 5 

<210> 232 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-Pro 

<221> MOD__RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 232 

Xaa Phe Arg Ser Ser Xaa 
1 5 

<210> 233 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-Val 
<221> MOD RES 
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<222> 6 

<223> Leucine -therapeutic agent 
<400> 233 

Xaa Leu Arg Ser Ser Xaa 
1 S 

<210> 234 
<:211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is Bz-Ile; 
Bz is benzoyl 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 234 

Xaa Glu Gly Arg Ser Ser Xaa 
1 5 

<210> 235 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_EES 
<222> 1 

<223> Xaa is Bz-Ile 

<221> MOD_RES 
<222> 2 

<223> Xaa is Glu (ganima-OMe) 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 235 

Xaa Xaa Gly Arg Ser Ser Xaa 
1 5 

<210> 236 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD RES 
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<222> 1 

<223> Xaa is benzoyls- Pro 

<221> MOD_RES 
<222> S 

<223> Leucine -therapeutic agent 
<400> 236 

Xaa Phe Arg Ser Ser Xaa 
1 5 

<210> 237 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-Phe 

<221> MOD_RES 
<222> 2 

<223> Xaa is Pip 1b pipecolinic acid 

<221> MOD_RES 
<222> 6 

<223> Leucine- therapeutic acid 
<400> 237 

Xaa Xaa Arg Ser Ser Xaa 
1 S 

<210> 238 
<211> 6 
<:212> PRT 

<213> Artificial Sequence 
<220> 

c223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-Val 

<221> MOD_IlES 
<222> 6 

<223> Leucine -therapeutic acid 
<400> 238 

Xaa Leu Lys Ser Ser Xaa 
1 5 

<2X0> 239 
<211> 6 
<212> PRT 

<213> Artificial Sequence 



<220> 
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<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-Nle 

<221> MOD_RES 
<222> 2 

<223> Xaa ±a HHT: hexahydxotyrosyl 

<221> MOD_RES 
<222> 6 

<223> Xaa is leucine- therapeutic agent 
<400> 239 

Xaa Xaa Lys Ser Ser Xaa 
1 5 



<210> 


240 




<211> 


8 




<212> 


PRT 




<213> 


Artificial Sequence 




<220> 






<223> 


Con j ugate 




<221> 


MOD RES 




<222> 


X 




<223> 


Xaa is pyroglutamic 


acid 


<221> 


MOD RES 




<222> 


8 




<223> 


Leucine - therapeut i c 


agent 


<400> 


240 




Xaa Arg Thr Lys Arg Ser Ser Xaa 


1 


5 




<210> 


241 




<211> 


7 




<212> 


PRT 





<213> Artificial Seq[uence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-Arg 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 241 

Xaa Gin Arg Arg Ser Ser Xaa 
1 5 



<210> 242 
<211> G 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is Boc-Gln 

<221> MOD_RES 
<222> 6 

<223> Xaa is Leucine -therapeutic agent 
<400> 242 

Xaa Gly Arg Ser Ser Xaa 
1 5 

<210> 243 
<211> S 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<:221> MOD_RES 
<222> 1 

<223> Xaa is Z-Arg: 

Z is benzyloxycarbonyl 

<221> MOD_RES 
<222> 5 

<22Z> Leucine -therapeutic agent 
<400> 243 

Xaa Arg Ser Ser Xaa 
1 5 

<210> 244 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-HHT: HHT is hexahydrotyrosyl 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 244 

Xaa Ala Arg Ser Ser Xaa 
1 5 

<210> 245 
<211> 6 
<212> PRT 
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<2135- Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is H-D-C3T: CHT is hexahydrotyrosyl 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 245 

Xaa Gly Arg Ser Ser Xaa 
1 5 

<210> 246 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is MeS02-dPhe 

<221> MOD_RES 
<222> € 

<223> Iieucine- therapeutic agent 
<400> 246 

Xaa Pro Arg Ser Ser Xaa 
1 5 

<210> 247 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<22X> MOD_RES 
<222> 1 

<223> Xaa is delta-Z-D-Lys : Z is benzyloxycarbonyl 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 247 

Xaa Pro Arg Ser Ser Xaa 
1 5 

<210> 248 
<211> 5 
<212> PRT 

<213> Artificial Sequence 



wo 02/095007 



PCT/US02/16819 



-158- 



<220> 

<223> Conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is CH3S02-D-CHA: CHA is cyclohexylalanyl 



<221> MOD_RES 
<222> 2 

<222> Xaa is But-Arg 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<400> 248 

Xaa Xaa Ser Ser Xaa 
1 5 

<210> 249 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<22Q> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 249 

Arg Gin Ser Arg Ser Ser Xaa 
1 5 

<210> 250 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 250 

Arg Arg Gin Ser Arg Ser Ser Xaa 
1 5 



<210> 251 
<211> 9 
<212> PRT 
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<213> Artificail sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 9 

<223> Leucine -therapeutic agent 
<400> 251 

Leu Arg Arg Gin Ser Arg Ser Ser Xaa 
1 5 

<210> 252 
<211> 5 
<212> PRT 

<:213> Artificial Sequence 
<220> 

<:223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<:222> 5 

<223> Leucine-therapeutic agent 
<400> 252 

Arg Gin Ser Arg Xaa 
1 5 

<210> 253 

<2ii> e 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> € 

<223> Leucine-therapeutic agent 
<400> 253 

Arg Arg Gin Ser Arg Xaa 
1 5 

<210> 254 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

< 2 2 3 > Con j ugat e 
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<221> MOD_RES 
<222> 1 

<221> MOD_RES 
<222> 9 

<:223> Leucine -therapeutic agent 
<400> 254 

Leu Arg Arg Gin Ser Arg Ser Ser Xaa 
1 5 

<210> 255 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<22i> MOD__RES 
<222> 8 

<223> Leucine- therapeutic agent 
<400> 255 

Leu Arg Arg Gin Ser Arg Ser Xaa 
1 5 

<210> 256 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 256 

Arg Arg Gin Ser Arg Ser Xaa 
1 5 

<2X0> 257 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD RES 
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<222> 9 

c223> Leucine -therapeutic agent 
<4O0> 257 

Leu Arg Arg Gin Ser Arg Ser Ser Xaa 
1 5 

<210> 258 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<:221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine- therapeutic agent 
<400> 258 

Arg 61n Gly Arg Ser Ser Xaa 

1 5 

<210> 259 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 259 

Arg Gin Ala Arg Ser Ser Xaa 
1 5 

<210> 260 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic acid 
<400> 260 
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Arg Gin Phe Arg Ser Ser Xaa 
1 5 

<210> 261 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine-therapeutic agent 
<400> 261 

Arg Ser Arg Ser Ser Xaa 
1 5 

<210> 262 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine-therapeutic agent 
<400> 262 

Arg Gly Arg Ser Ser Xaa 
1 5 

<210> 263 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine-therapeutic agent 
<400> 263 

Arg Ala Arg Ser Ser Xaa 
1 5 



<210> 264 
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<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 264 

Arg Phe Arg Ser Ser Xaa 
1 5 

<210> 265 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 265 

Gin Ser Arg Ser Ser Xaa 
1 5 

<210> 266 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 266 

Gin Gly Arg Ser Ser Xaa 
1 5 

<210> 267 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> leucine -therapeutic agent 
<400> 267 

Gin Ala Arg Ser Ser Xaa 
1 5 

<210> 268 
<211> 6 
<212> PRT 

<:213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 268 

Gin Phe Arg Ser Ser Xaa 
1 5 

<210> 269 
<211> 816 
<212> DNA 
<213> Homo Sapien 

<220> 

<221> CDS 

<222> (!}... (816) 

<223> Nucleotide sequence encoding MTSP25, including 
MTSP2S protease domain 

<221> misc feature 
<222> (2487. - - (270) 

<223> Transmembrane domain encompasses amino acids 
248-270 at the C-terminus of the trypsin-like 
serine protease domain {amino acids 1-237) 

<400> 269 

att ata ggg ggc acc gaa gca caa get ggc gca tgg ccg tgg gtg gtg 48 
lie lie Gly Gly Thr Glu Ala Gin Ala Gly Ala Trp Pro Trp Val Val 
15 10 IS 

age ctg cag att aaa tat ggc cgt gtt ctt gtt cat gta tgt ggg gga 96 
Ser Leu Gin lie Lys Tyr Gly Arg Val Leu Val His Val Cys Gly Gly 
20 25 30 

acc eta gtg aga gag agg tgg gtc etc aca get gee cac tgc act aaa 144 
Thr Leu Val Arg Glu Arg Trp Val Leu Thr Ala Ala His Cye Thr Lys 
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35 40 45 

gac get age gat cct tta atg tgg aca get gtg att gga act aat aat 192 
Asp Ala Ser Asp Pro Leu Met Trp Thr Ala Val lie Gly Thr Asn Asn 
SO 55 60 

ata eat gga cgc tat cct cat acc aag aag ata aaa att aaa gca ate 240 
lie His Gly Axg Tyx Pro His Thr Lys Lye lie Lys lie Lye Ala lie 
65 70 75 80 

att att cat cca aac ttc att ttg gaa tct tat gta aat gat att gca 288 
lie He His Pro Asn Phe lie Leu Glu Ser Tyr Val Asn Asp He Ala 

85 90 95 

ctt ttt cac tta aaa aaa gca gtg agg tat aat gac tat att cag cct 336 
Leu Phe His Leu Lys Lys Ala Val Arg Tyr Asn Asp Tyr He Gin Pro 
100 105 110 

att tgc eta ect ttt gat gtt ttc eaa ate ctg gac gga aae aea aag 3 84 
He Cys Leu Pro Phe Asp Val Phe Gin He Leu Asp Gly Asn Thr Lys 
115 120 125 

tgt ttt ata agt ggc tgg gga aga aca aaa gaa gaa ggt aac get aea 432 
Cys Phe He Ser Gly Trp Gly Arg Thr Lys Glu Glu Gly Asn Ala Thr 
130 135 140 

aat att tta eaa gat gca gaa gtg eat tat att tet cga gag atg tgt 480 
Asn He Leu Gin Asp Ala Glu Val His Tyr He Ser Arg Glu Met Cys 
145 150 155 160 

aat tct gag agg agt tat ggg gga ata att cct aac act tea ttt tgt 528 
Asn Ser Glu Arg Ser Tyr Gly Gly He He Pro Asn Thr Ser Phe Cys 

165 170 175 

gca ggt gat gaa gat gga get ttt gat act tgc agg ggt gac agt ggg 576 
Ala Gly Asp Glu Asp Gly Ala Phe Asp Thr Cys Arg Gly Asp Ser Gly 
180 185 190 

gga cca tta atg tgc tac tta cca gaa tat aaa aga ttt ttt gta atg 624 
Gly Pro Leu Met Cys Tyr Leu Pro Glu Tyr Lys Arg Phe Phe Val Met 
195 200 205 

gga att acc agt tac gga cat ggc tgt ggt cga aga ggt ttt cct ggt 672 
Gly He Thr Ser Tyr Gly His Gly Cys Gly Arg Arg Gly Phe Pro Gly 
210 215 220 

gtc tat att ggg cca tec ttc tac eaa aag tgg ctg aca gag cat ttc 720 
Val Tyr He Gly Pro Ser Phe Tyr Gin Lys Trp Leu Thr Glu His Phe 
225 230 235 240 

ttc cat gca age act caa ggc ata ctt act ata aat att tta cgt ggc 768 
Phe His Ala Ser Thr Gin Gly He Leu Thr He Asn He Leu Arg Gly 

245 250 255 

cag ate etc ata get tta tgt ttt gtc ate tta eta gea aca aca taa 816 
Gin He Leu He Ala Leu Cys Phe Val He Leu Leu Ala Thr Thr * 
260 265 270 

<210> 270 
<211> 271 
<212> PRT 
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<213> Homo Sapien 



<400> 270 





lie 


Gly 


Gly 


Txir 


Glu Ala Gin Ala Gly Ala Trp Pro 


Trp 


Val 


Val 


1 








5 




10 




15 




Ser 


lieu 


Gin 


lie 


Lys 


Tyr Gly Arg Val 


Leu Val His Val 


Cys 


Gly Gly 








20 




25 




30 






Thr 


Leu 


Val 


Arcf 


Glu 


Arg Trp Val Leu 


Thr Ala Ala His 


Cys 


Thr 


Lys 






35 






40 


45 








Asp 


Ala 


Sex 


Asp 


Pro 


Leu Met Trp Thr 


Ala Val He Gly Thr 


Asn 


Asn 




50 








55 


60 








lie 


His 


CsXy 


Ar^ 


Tyr 


Pro His Thr Lys 


Lys He Lys He 


Lys 


Ala 


He 


65 










70 


75 






80 


lie 


lie 


Hxs 


Pro 


Asn 


Phe He Leu Glu Ser Tyr Val Asn 


Asp 


He 


Ala 










85 




90 




95 




Leu 


Phe 


Hxs 


Leu 


Lys 


Lys Ala Val Arg 


Tyr Asn Asp Tyr 


He 


Gin 


Pro 








100 


105 




110 






lie 


Cys 


Leu 


Pro 


Phe 


Asp Val Phe Gin 


He Leu Asp Gly Asn 


Thr 


Lys 






115 






120 


125 








Cya 


Phe 


He 


Ser 


Gly 


Trp Gly Arg Thr Lys Glu Glu Gly Asn 


Ala 


Thr 




13 0 








135 


140 








Asn 


He 


Leu 


Gin 


Asp 


Ala Glu Val His 


Tyr He Ser Arg 


Glu 


Met 


Cys 


145 








150 


155 






160 


Asxi 


Ser 


Glu 


Arg 


Ser 


Tyr Gly Gly He 


He Pro Asn Thr 


Ser 


Phe 


Cys 










165 




170 




175 




Ala Gly 


Asp 


Glu 


Asp 


Gly Ala Phe Asp Thr Cys Arg Gly Asp 


Ser 


Gly 






180 




185 




190 






Gly 


Pro 


Leu 


Met 


Cys 


Tyr Leu Pro Glu 


Tyr Lys Arg Phe 


Phe 


Val 


Met 




195 




200 


205 








Gly 


He 


Thr 


Ser 


Tyr 


Gly His Gly Cys 


Gly Arg Arg Gly 


Phe 


Pro 


Gly 




210 








215 


220 








Val 


Tyr 


He 


Gly 


Pro 


Ser Phe Tyr Gin 


Lys Trp Leu Thr 


Glu 


His 


Phe 


225 










230 


235 






240 


Phe 


His 


Ala 


Ser 


Thr 


Gin Gly He Leu 


Thr He Asn He 


Leu 


Arg 


Gly 










245 


250 




255 




Gin 


He 


Leu 


He 


Ala 


Leu Cys Phe Val 


He Leu Leu Ala 


Thr 


Thr 










260 




265 




270 







<210> 271 

<211> 8 

<212> PRT 

<213> amino acids 401-407 of SEQ ID No. 97 in WO 02/00860 

<400> 271 

Arg Lys His Leu Pro Arg Pro Ala 
1 5 

<210> 272 
<211> 228 
<212> PRT 

<213> alternative PDl of MTSP12 
<400> 272 

He Val Gly Gly Met Glu Ala Ser Pro Gly Glu Phe Pro Trp Gin Ala 

15 10 15 

Ser Leu Arg Glu Asn Lys Glu His Phe Cys Gly Ala Ala He He Asn 

20 25 30 

Ala Arg Trp Leu Val Ser Ala Ala His Cys Phe Asn Glu Phe Gin Asp 
35 40 45 
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Pro 


Thr 


Lys 


Trp 


Val 


Ala Tyr Val Gly Ala 


Thr Tyr 


Leu Ser 


Gly 


Ser 




50 






55 


60 








Glu 


Ala 


Ser 


Thr 


Val 


Arg Ala Gin Val Val 


Gin lie 


Val Lys 


His 


Pro 


65 










70 


75 






ou 


Leu 


Tyx 


Asn 


Ala 


Asp 


Thr Ala Asp Phe Asp 


Val Ala 


Val Leu 


Glu 


Leu 








85 


90 






95 




Thr 


Ser 


Pro 


Leu 


Pro 


Phe Gly Arg His lie 


Gin Pro 


Val Cys 


Leu 


fro 








100 




105 




110 






Ala 


Ala 


Thr 


His 


lie 


Phe Pro Pro Ser Lys 


Lys Cys 


Leu - lie 


Ser 


Gly 






115 






120 




125 






Trp 


Gly 


Tyr 


Leu 


Lye 


Glu Asp Phe Leu Arg 


Lye His 


Leu Pro 


Arg 


Pro 


13 0 








135 


140 








Ala 


Val 


Lys 


Pro Gly 


Val Leu Gin Lys Ala 


Thr Val 


Glu Leu 


Leu 


Asp 


145 








15 0 


155 






160 


Gin 


Ala 


Leu 


Cys 


Ala 


Ser Leu Tyr Gly His 


Ser Leu Thr Asp Arg 


Met 








165 


170 




175 




Val 


Cys 


Ala 


Gly 


Tyr 


Leu Asp Gly Lys Val Asp Ser 


Cys Gin Gly 


Asp 






180 




185 




190 






Ser 


Gly 


Gly 


Pro 


Leu 


Val Cys Glu Glu Pro 


Ser Gly Arg Phe 


Ser 


Leu 




195 






200 




205 






Ala 


Gly 


lie 


Val 


Ser 


Trp Gly lie Gly Cys Ala Glu Ala Arg Arg 


Pro 




210 








215 


220 








Gly 


Val 


Tyr 


Ala 














225 





















<210> 273 
<211> 804 
<212> DNA 
<213> Homo Sapien 

<220> 

<221> CDS 

<222> ' (1) . . • (804) 

<223> Nucleotide sequence encoding MTSP20, including 
MTSP20 protease domain 

<400> 273 

aca gca ggt ccc cag gca gga gca ccc tec oca tgg ccc tgg gag gcc 48 

Thr Ala Gly Pro Gin Ala Gly Ala Pro Ser Pro Trp Pro Trp Glu Ala 
15 10 15 

agg ctg atg cac cag gga cag ctg gcc tgt ggc gga gcc ctg gtg tea 96 
Arg Leu Met His Gin Gly Gin Leu Ala Cys Gly Gly Ala Leu Val Ser 
20 25 30 

gag gag acg gtg eta act gtt gcc cac tgc ttc att ggg cgc cag gcc 144 
Glu Glu Thr Val Leu Thr Val Ala His Cys Phe lie Gly Arg Gin Ala 
35 40 45 

cca gag gaa tgg age gta ggg ctg ggg acc aga ccg gag gag tgg ggc 192 
Pro Glu Glu Trp Ser Val Gly Leu Gly Thr Arg Pro Glu Glu Trp Gly 
50 55 60 

ctg aag cag etc ate ctg cat gga gcc tac acc cac cct gag ggg ggc 240 
Leu Lys Gin Leu lie Leu His Gly Ala Tyr Thr His Pro Glu Gly Gly 
65 70 75 80 

tac gac atg gcc etc ctg ctg ctg gcc cag cet gtg aca ctg gga gcc 2 88 

Tyr Asp Met Ala Leu Leu Leu Leu Ala Gin Pro Val Thr Leu Gly Ala 

85 90 95 



wo 02/095007 
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168- 



agc ctg egg ccc etc tgc ctg ccc tat cet gac eac cac ctg cct gat 336 
Ser Leu Arg Pro Leu Cys Leu Pro Tyr Pro Asp His His Leu Pro Asp 
lOO 105 110 

ggg gag cgt ggc tgg gtt ctg gga egg gcc cgc cca gga gca ggc ate 3B4 
Gly Glu Arg Gly Trp Val Leu Gly Arg Ala Arg Pro Gly Ala Gly lie 
115 120 125 

age tec etc cag aea gtg ccc gtg ace etc etg ggg cet agg gcc tgc 432 
Ser Ser Leu Gin Thr Val Pro Val Thr Leu Leu Gly Pro Arg Ala Cys 
130 135 140 

age egg ctg cat gca get cct ggg ggt gat ggc age cet att etg ecg 480 
Ser Arg Leu His Ala Ala Pro Gly Gly Asp Gly Ser Pro lie Leu Pro 
145 150 155 160 

ggg atg gtg tgt acc agt get gtg ggt gag ctg ccc age tgt gag ggc 528 
Gly Met Val Cys Thr Ser Ala Val Gly Glu Leu Pro Ser Cys Glu Gly 

165 170 175 

ctg tct ggg gca cca ctg gtg cat gag gtg agg ggc aca tgg ttc etg 576 
Leu Ser Gly Ala Pro Leu Val His Glu Val Arg Gly Thr Trp Phe Leu 
180 185 190 

gcc ggg ctg cac age ttc gga gat get tgc caa ggc ccc gcc agg ccg 624 
Ala Gly Leu His Ser Phe Gly Asp Ala Cys Gin Gly Pro Ala Arg Pro 
195 200 205 

gcg gte ttc acc gcg etc cet gcc tat gag gac tgg gtc age agt ttg 672 
Ala Val Phe Thr Ala Leu Pro Ala Tyr Glu Asp Trp Val Ser Ser Leu 
210 215 220 

gac tgg cag gtc tac ttc gcc gag gaa cca gag ccc gag get gag cet 720 
Asp Trp Gin Val Tyr Phe Ala Glu Glu Pro Glu Pro Glu Ala Glu Pro 
225 230 235 240 

gga age tgc ctg gcc aac atg agt atg tgg ccc egg ggc etc ctg cca 768 
Gly Ser Cys Leu Ala Asn Met Ser Met Trp Pro Arg Gly Leu Leu Pro 

245 250 255 

aac cct gcc tct cca gga ccc ttc tct etc cag tga 804 
Asn Pro Ala Ser Pro Gly Pro Phe Ser Leu Gin * 
260 265 

<210> 274 
<211> 267 
<212> PRT 
<213> Homo Sapien 

<400> 274 

Thr Ala Gly Pro Gin Ala Gly Ala Pro Ser Pro Trp Pro Trp Glu Ala 

15 10 15 

Arg Leu Met His Gin Gly Gin Leu Ala Cys Gly Gly Ala Leu Val Ser 

20 25 30 

Glu Glu Thr Val Leu Thr Val Ala His Cys Phe lie Gly Arg Gin Ala 

35 40 45 

Pro Glu Glu Trp Ser Val Gly Leu Gly Thr Arg Pro Glu Glu Trp Gly 

50 55 60 

Leu Lys Gin Leu lie Leu His Gly Ala Tyr Thr His Pro Glu Gly Gly 
65 70 75 80 
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Tyx 


Asp 


Met 


Ala 


Leu 


Leu 


Leu Leu 


Ala Gin 


Pro Val Thr 


Leu 


Gly 


Ala 






85 






90 






95 




Ser 


Leu 


Arg 


Pro 


Leu 


Cys 


Leu Pro 


Tyr Pro 


Asp His His 


Leu 


Pro 




Asp 








100 








105 




110 






Gly 


Glu 


Arg 


Gly Trp Val 


Leu Gly 


Arg Ala 


Arg Pro Gly Ala 


Gly 


lie 






115 








120 




125 








Ser 


Ser 


Leu 


Gin 


Thr 


Val 


Pro Val 


Thr Leu 


Leu Gly Pro 


Arg 


Ala 


Cys 




X30 










135 




140 








Ser 


Arg 


Leu 


His 


Ala Ala Pro Gly 


Gly Asp 


Gly Ser Pro 


lie 


Leu 


Pro 


145 








150 






155 






160 


Gly 


Met 


Val 


Cys 


Thr 


Ser 


Ala Val 


Gly Glu 


Leu Pro Ser 


Cys 


Glu 


Gly 








165 






170 






175 




Leu 


Ser 


Gly 


Ala 


Pro 


Leu 


Val His 


Glu Val 


Arg Gly Thr Trp 


Phe 


Leu 






180 








185 




190 






Ala 


Gly 


Leu 


His 


Ser 


Phe Gly Asp 


Ala Cys 


Gin Gly Pro 


Ala 


Arg 


Pro 






195 








200 




205 








Ala 


Val 


Phe 


Thr 


Ala 


Leu 


Pro Ala 


Tyr Glu 


Asp Trp Val 


Ser 


Ser 


Leu 




210 










215 




220 








Asp 


Trp 


Gin 


Val 


Tyr 


Phe 


Ala Glu 


Glu Pro 


Glu Pro Glu 


Ala 


Glu 


Pro 


225 








230 






235 






240 


Gly 


Ser 


Cys 


Leu 


Ala 


Asn 


Met Ser 


Met Trp 


Pro Arg Gly 


Leu 


Leu 


Pro 






245 






250 






255 




Asn 


Pro 


Ala 


Ser 


Pro 


Gly 


Pro Phe 


Ser Leu 


Gin 









260 265 

<210> 275 
<211> 699 
<212> DNA 
<213> Homo Sapien 

<220> 

<221> CDS 

<222> (1) . . . (699) 

<223> Nucleotide secpience encoding MTSP22, including 
(irSP22 protease domain 

<400> 275 

att gtg aat gga aaa age tec ctg gag ggg gca tgg cca tgg cag gcc 48 

lie Val Asn Gly Lys Ser Ser Leu Glu Gly Ala Trp Pro Trp Gin Ala 
IS 10 15 

age atg caa tgg aaa gge egt cac tac tgt gga gcc tct ctg ate age 96 
Ser Met Gin Trp Lys Gly Arg His Tyr Cys Gly Ala Ser Leu lie Ser 
20 25 30 

age agg tgg eta tta tct gca get cac tgc ttt get aag aaa aat aat 144 
Ser Arg Trp Leu Leu Ser Ala Ala His Cys Phe Ala Lys Lys Asn Asn 
35 40 45 

tea aaa gat tgg act gtc aac ttt gga gtt gta gta aat aaa cca tat 192 
Ser Lys Asp Trp Thr Val Asn Phe Gly Val Val Val Asn Lys Pro Tyr 
50 55 60 

atg aca egg aaa gtc caa aac att att ttt cat gaa aat tat age agt 240 
Met Thr Arg Lye Val Gin Asn lie lie Phe His Glu Asn Tyr Ser Ser 
65 70 75 80 

cet ggg ctt cat gat gat att gcc ctt gtg cag ett get gaa gaa gtt 288 
Pro Gly Leu His Asp Asp lie Ala Leu Val Gin Leu Ala Glu Glu Val 

85 90 95 
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tct ttt aca gag tac att cgt aag att tgt ctt cct gaa gcc aaa atg 336 
Ser Phe Thr Glu Tyr lie Arg Lys lie Cys Leu Pro Glu Ala Lys Met 
100 105 110 

aag etc tea gaa aat gac aat gtt gta gtt aca ggt tgg gga aca ctt 3 84 

Lys Leu Ser Glu Asn Asp Asn Val Val Val Thr Gly Trp Gly Thr Leu 
115 120 125 

tat atg aat ggt tea ttt oca gtg ata ctt caa gaa gcc ttt ttg aag 432 
Tyr Met Asn Gly Ser Phe Pro Val He Leu Gin Glu Ala Phe Leu Lys 
130 135 140 

att att gac aac aaa att tgc aat gcc tea tat gca tac tct ggc tta 480 
He He Asp Asn Lys He Cys Asn Ala Ser Tyr Ala Tyr Ser Gly Leu 
145 150 155 ISO 

528 



576 



624 



672 



699 



gtg 


act 


gat 


aca 


atg 


tta 


tgt 


get 


gga 


ttt 


atg 


tea 


gga 


gaa 


get 


gat 


Val 


Thr 


Asp 


Thr 


Met 


Leu 


Cys 


Ala Gly Phe 


Met 


Ser 


Gly Glu Ala Asp 








165 










170 










175 




gca 


tgt 


cag 


aat 


gat 


tct 


ggt 


gga 


cca 


eta 


get 


tac 


eet 


gat 


tec 


aga 


Ala 


Cys 


Gin Asn Asp 


Ser 


Gly Gly 


Pro 


Leu 


Ala 


Tyr 


Pro 


Asp 


Ser 


Arg 






180 










185 










190 






aat 


ate 


tgg 


cat 


ctt 


gtt 


gga 


ata 


gta 


age 


tgg 


ggt 


gat 


gga 


tgt 


ggt 


Asn 


He 


Trp 


His 


Leu 


Val 


Gly 


He 


Val 


Ser 


Trp 


Gly 


Asp 


Gly Cys 


Gly 






195 










200 










205 








aaa 


aag 


aat 


aag 


cca 


ggt 


gtc 


tat 


act 


cga 


gtg 


act 


tct 


tat 


cgc 


aat 


Lys 


Lys 


Asn 


Lys 


Pro Gly 


Val 


Tyr 


Thr 


Arg 


Val 


Thr 


Ser 


Tyr Arg 


Asn 


210 










215 










220 










tgg 


att 


aca 


tec 


aag 


act 


gga 


etc 


tga 
















Trp 


He 


Thr 


Ser 


Lys 


Thr 


Gly Leu 


* 
















225 










230 






















<210> 276 




























<211> 232 




























<212> PRT 




























<213> Homo Sapien 
























<400> 276 




























He 


Val 


Asn 


Gly 


Lys 


Ser 


Ser 


Leu Glu Gly Ala Trp 


Pro 


Trp 


Gin 


Ala 


1 








5 










10 










15 




Ser 


Met 


Gin 


Trp 


Lys 


Gly 


Arg His 


Tyr 


Cys 


Gly Ala 


Ser 


Leu 


He 


Ser 








20 










25 










30 






Ser 


Arg 


Trp 


Leu 


Leu 


Ser 


Ala 


Ala 


His 


Cys 


Phe 


Ala 


Lys 


Lys 


Asn 


Asn 






35 










40 










45 








Ser 


Lys 


Asp 


Trp 


Thr 


Val 


Asn 


Phe 


Gly Val 


Val 


Val 


Asn 


Lys 


Pro 


Tyr 




50 










55 










60 










Met 


Thr 


Arg 


Lys 


Val 


Gin 


Asn 


He 


He 


Phe 


His 


Glu 


Asn 


Tyr 


Ser 


Ser 


65 






70 










75 










80 


Pro 


Gly 


Leu 


His 


Asp Asp 


He 


Ala 


Leu 


Val 


Gin 


Leu 


Ala 


Glu 


Glu 


Val 








85 










90 










95 




Ser 


Phe 


Thr 


Glu 


Tyr 


He 


Arg 


Lys 


He 


Cys 


Leu 


Pro 


Glu 


Ala 


Lys 


Met 








100 










105 










110 






Lys 


Leu 


Ser 


Glu 


Asn Asp 


Asn 


Val 


Val 


Val 


Thr Gly 


Trp 


Gly 


Thr 


Leu 




115 










120 










125 








Tyr 


Met 


Asn 


Gly 


Ser 


Phe 


Pro 


Val 


He 


Leu 


Gin 


Glu 


Ala 


Phe 


Leu 


Lys 


130 










135 










140 











f 
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He 


He 


Asp 


Asn 


Lys 


He Cys Asn Ala 


Ser 


Tyr 


Ala 


Tyr 


Ser 


Gly Leu 


145 










150 




155 








160 


Val 


Thr 


Asp 


Thr 


Met 


Leu Cys Ala Gly 


Phe 


Met 


Ser 


Gly Glu Ala Asp 










165 




170 










175 


Ala 


CyB 


Gin 


Asn Asp 


Ser Gly Gly Pro 


Leu 


Ala 


Tyr 


Pro 


Asp 


Ser Arg 








180 




185 










190 




Asn 


He 


Trp 


His 


Leu 


Val Gly He Val 


Ser 


Trp 


Gly 


Asp 


Gly 


Cys Gly 






195 






200 








205 






Lys 


Lys 


Asn 


Lys 


Pro 


Gly Val Tyx Thr 


Arg 


Val 


Thr 


Ser 


Tyr 


Arg Asn 




210 








215 






220 








Trp 


He 


Thr 


Ser 


Lys 


Thr Gly Leu 















<210> 277 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine-therapeutic agent 
<223> conjugate 
<400> 277 

Gly Ser Gly Arg Ser Xaa 
1 5 



<210> 278 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD__RES 
<222> 6 

<223> Xaa is Leucine-therapeutic Agent 
<223> conjugate 
<400> 278 

Gly Ser Gly Arg Ser Xaa 
1 5 



<210> 279 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 
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<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 279 

Gly Ser Gly Arg Ser Ser Xaa 
1 5 



<210> 280 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<:221> ACETYIiATION 
<222> 1 

<221> MOD_RES 

<222> 5 

<223> Leucine -therapeutic Agent 
<223> conjugate 

<400> 280 

Gly Ser Gly Arg Xaa 
1 5 



<210> 281 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> AMIDATION 
<222> 6 

<221> MOD_RES 

<222> 4 

<223> Xaa is 4-Guanidino-phenylglycine 
<223> conjugate 

<400> 281 

Gly Ser Gly Xaa Ser Leu 
1 5 



<210> 282 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<22 0> 

<221> ACETYLATION 
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<222> 1 

<221> MOD_RES 
<222> 7 

<223> Cyclohexylamine- therapeutic Agent 
<223> conjugate 
<400> 282 

Gly Ser Gly Arg Ser Ser Xaa 
1 5 



<210> 283 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATIOK 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<2 2 3 > con j ugate 
<400> 283 

Gly Ser Gly Arg Ala Ser Xaa 
1 5 



<210> 284 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> KOD_RES 
<222> 6 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 284 

Gly Ser Gly Arg Ser Xaa 
1 5 



<210> 285 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> (0) . . . (0) 
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<221> MOD_RES 
<222> 6 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 285 

Gly Thr Gly Arg Ser Xaa 
1 5 



<210> 286 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> 1 

<22 3 > Succinyl -EAlanine 

<221> MOD_RES 
<222> 6 

<223> Nle- therapeutic Agent 
<223> conjugate 
<400> 286 

Ala Thr Gly Arg Ser Xaa 
1 5 



<210>" 287 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Cyclohexylalanine- therapeutic agent 
<223> conjugate 
<400> 287 

Gly Thr Gly Arg Ser Xaa 
1 5 



<210> 288 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> X 
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<221> MOD_RES 
<222> 2 

<223> Xaa is Homoserine 

<221> MOD_RES 
<222> 6 

<223> Nle -Therapeutic AgentNle 
<223> conjugate 
<400> 288 

Gly Xaa Gly Arg Ser Xaa 
1 5 



<210> 289 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 2 

<223> Xaa is D Serine 

<221> MOD_RES 
<222> 6 

<223> Alcmine-therapeutic Agent 
<223> conjugate 
<400> 289 

Gly Xaa Ala Arg Ser Xaa 
1 5 



<210> 290 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 290 

Gly Ser Ala Arg Ser Xaa 
1 5 



<210> 291 
<211> 7 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD__RES 
<222> 7 

<223> Cyclohexylalanine- therapeutic agent 
<223> conjugate 
<4O0> 291 

Gly Ser Ala Arg Ser Ser Xaa 
1 5 



<210> 292 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 292 

Gly Ser Ala Arg Ser Ser Xaa 
1 5 



<210> 293 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 293 

Gly Ser Ala Arg Ala Ser Xaa 
1 5 



<210> 294 
<211> 6 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<221> ACETYLATION 

<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 294 

Val Ser Gly Arg Ser Xaa 
1 5 



<210> 295 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACEryiiATION 
<222> 1 

<221> MOD_RES 
•c222> 6 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 295 

Val Ser Gly Arg Ala Xaa 
1 5 



<210> 296 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine- therapeutic Agent 
<223> conjugate 
<400> 296 

Val Ser Gly Arg Ala Ser Xaa 
1 5 



<210> 297 

<211> 7 

<212> PRT 

<213> Artificial Sequence 
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<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 297 

Val Ser Gly Arg Ser Ser Xaa 
1 5 



<210> 298 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Alanine- therapeutic Agent 
<223> conjugate 
<400> 298 

Val Ser Ala Arg Met Xaa 
1 5 



<210> 299 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Xaa is Nle 

<221> MOD_RES 
<222> 6 

<223> Alanine -therapeutic Agent 
<223> conjugate 
<400> 299 

Val Ser Ala Arg Xaa Xaa 
1 5 



<210> 300 
<211> 6 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 300 

Val Ser Ala Arg Ser Xaa 
1 5 



<210> 301 
<:211> 6 
<212> PRT 

<213> Artificial Sequence 
<:220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<:222> 6 

<223> Leucine- therapeutic Agent 
<223> conjugate 
<400> 301 

Val Ser Ala Arg Ser Xaa 
1 5 



<210> 302 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD__RES 
<222> 1 

<223> Xaa is S-Methylcysteine 

<221> MOD_RES 
<222> 6 

<223> dValine- therapeutic Agent 
<223> conjugate 



<400> 302 

Xaa Pro Gly Arg Val Xaa 
1 5 
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<210> 303 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 1 

<223> Xaa is S -Methyl cysteine 

<221> MOD RES 
<222> 6 

<223> Valine- therapeutic Agent 
<223> conjugate 
<400> 303 

Xaa Pro Gly Arg Val Xaa 
1 5 



<210> 304 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 1 

<223> Xaa is S- Me thy Icy at sine 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 304 

Xaa Pro Gly Arg Ala Xaa 
1 5 



<210> 305 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 1 

<223> Xaa is S-Methylcysteine 
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<221> MOD__RES 
<222> 6 

<223> Leucine -therapeutic Agent 
< 2 2 3 > con j ugat e 
<400> 305 

Xaa Pro Gly Arg Ser Xaa 
1 5 



<210> 306 
<2X1> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIATION 
<222> 1 

<221> MOD_RES 
<222> 1 

<223> Xaa is S- Methyl cysteine 

<221> MOD_RES 
<222> 6 

<223> Leucine- therapeutic Agent 
<223> conjugate 
<400> 306 

Xaa Pro Ala Arg Ser Xaa 
1 5 



<210> 307 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 1 

<223> Xaa is S-Methylcysteine 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 307 

Xaa Pro Ala Arg Ala Ser Xaa 
1 5 



<210> 308 
<211> 6 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 

<222> a 

<221> MOD_RES 

<222> 1 

<223> Xaa is t- Butyl Glycine 

<221> MOD_RES 

<222> 7 

<223> Leucine -therapeutic Agent 

<223> conjugate 

<400> 308 

Xaa Pro Gly Arg Ser Xaa 
1 5 



<210> 309 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 3 

<223> Xaa is D Serine 

<221> MOD RES 
<222> 7 

<223> Alanine- therapeutic Agent 
<223> conjugate 
<400> 309 

Arg Gly Xaa Ala Arg Ser Xaa 
1 5 



<210> 310 

<211> 7 

<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 

<222> 1 

<221> MOD_RES 

<222> 7 

<223> Alanine- therapeutic Agent 



<2 2 3 > con j ugate 
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<400> 310 

Arg Gly Ser Gly Arg Ser Xaa 
1 5 



<210> 311 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 311 

Arg Gly Ser Gly Arg Ala Xaa 
1 5 



<210> 


312 


<211> 


7 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<221> 


ACETYIiATION 


<222> 


1 


<221> 


MOD RES 


<222> 


7 


<223> 


Leucine - 1 her apeut i c 


<223> 


conjugate 


<400> 


312 



Arg Gly Ser Gly Arg Ser Xaa 
1 5 



<210> 313 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Nle -therapeutic agent 
<223> conjugate 
<400> 313 
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Arg Gly Ser Gly Arg Ser Xaa 
1 S 



<210> 314 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> M0D_IIES 
<222> 7 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 314 

Arg Gly Ser Gly Arg Ala Xaa 
1 5 



<210> 315 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 315 

Arg Gly Ser Gly Arg Ser Ser Xaa 
1 5 



<210> 316 
<211> 7 
<212> PRT 

<213> Artificial Sec[uence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Cyclohexyl alanine -therapeutic agent 
<223> conjugate 
<400> 316 

Arg Gly Ser Gly Arg Ser Xaa 
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1 5 



<210> 317 
<211> 8 
<212> PRT 

<:213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Cyclohexylalanine-therapeutic agent 
<223> conjugate 
<400> 317 

2^xg Gly Ser Gly Arg Ser Ser Xaa 
1 5 



<210> 318 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIATION 
<222> 1 

<221> MOD_RES 
<:222> 7 

<223> Cyclohexylalanine- therapeutic agent 
<223> conjugate 
<400> 318 

Arg Gly Ser Ala Arg Ser Xaa 
1 5 



<210> 319 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
«:222> 1 

<221> MOD_RES 
<222> 8 

<223> Cyclohexylalanine-therapeutic agent 
<223> conjugate 



<400> 319 

Arg Gly Ser Ala Arg Ser Ser Xaa 
1 5 
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<210> 320 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Nva- therapeutic agent 
<223> conjugate 
<400> 320 

Arg Gly Ser Ala Arg Ser Xaa 
1 5 



<210> 321 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Nva-therapeutic agent 
<223> conjugate 
<400> 321 

Arg Gly Ser Ala Arg Ser Ser Xaa 
1 5 



<210> 322 
<21'1> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine- therapeutic Agent 
<223> conjugate 
<400> 322 

Arg Gly Ser Ala Arg Ser X^a 
1 5 



wo 02/095007 



PCT/US02/16819 



-187- 



<210> 


323 


<211> 


7 


<212> 


PRT 


<213> 


JUrtificial Sequence 


<220> 




<221> 


ACETYIiATION 


<222> 


1 


<221> 


MOD RES 


<222> 


2 


<223> 


Xaa is S -Methyl Cysteine 


<221> 


MOD RES 


<222> 


7 


<223> 


Val ine - therap eu t i c Agent 


<223> 


conjugate 


<400> 


323 


Arg Xaa Pro Gly Arg Val Xaa 


1 


5 



<210> 


324 


<211> 


7 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<221> 


ACETYLATION 


<222> 


1 


<221> 


MOD RES 


<222> 


2 


<223> 


Xaa is S- Methyl cysteine 


<221> 


MOD RES 


<222> 


7 


<223> 


Valine-therapeutic Agent 


<223> 


conjugate 


<400> 


324 



Arg Xaa Pro Gly Arg Val Xaa 



<210> 325 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 2 

<223> Xaa is S-Methylcyeteine 
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<221> MOD_RES 
<222> 7 

<223> Leucine- therapeutic Agent 
<223> conjugate 
<400> 325 

Arg Xaa Pro Gly Arg Ser Xaa 
1 5 



<210> 326 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 326 

Arg Leu Pro Gly Arg Ser Xaa 
1 5 



<210> 327 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> VARIANT 
<222> 7 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 327 

Arg Val Pro Gly Arg Ser Xaa 
1 5 



<210> 328 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> VARIANT 
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<222> 8 

<223> dLeucine- therapeutic Agent 
<223> conjugate 
<400> 328 

Arg Val Pro Gly Arg Ser Xaa 
1 5 



<210> 329 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 2 

<223> Xaa is Nle 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 329 

Arg Xaa Pro Gly Arg Ser Xaa 
1 5 



<210> 330 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 2 

<223> Xaa is t-Butylglycine 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 330 

Arg Xaa Pro Ala Ar^ Ser Xaa 
1 5 



<210> 331 
<211> 7 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 1 

<223> Leucine- therapeutic Agent 
<223> conjugate 
<400> 331 

Arg Leu Pro Ala Arg Ser Xaa 
1 5 



<210> 332 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine- therapeutic Agent 
<223> conjugate 
<400> 332 

Arg Val Pro Ala Arg Ser Xaa 
1 5 



<210> 333 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 2 

<223> Xaa is Nle 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic Agent 
<223> conjugate 



<400> 333 

Arg Xaa Pro Ala Arg Ser Xaa 
1 5 
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<210> 334 
<211> 7 
<212> PRT 

<213> Artificial Secpience 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine- therapeutic Agent 
<223> conjugate 
<400> 334 

He Val Ser Gly Arg Ala Xaa 
1 5 



<210> 335 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic Agent 
<2 2 3 > con j ugate 
«:400> 335 

lie Val Ser Gly Arg Ser' Ser Xaa 
1 5 



<210> 336 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<£221> MOD RES 
<222> 8 " 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 336 

He Val Ser Gly Arg Ala Ser Xaa 
1 5 



<210> 337 
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<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Alanine- therapeutic Agent 
<223> conjugate 
<400> 337 

He Val Ser Ala Arg Met Xaa 
1 5 



<210> 338 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Xaa is Nle 

<221> MOD_RES 
<222> 7 

<223> Alanine -therapeutic Agent 
<223> conjugate 
<400> 338 

He Val Ser Ala Arg Xaa Xaa 
1 5 



<210> 339 

<211> 7 

<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 

<222> 1 

<221> MOD_RES 

<222> 7 

<223> Leucine -therapeutic Agent 

<223> conjugate 



<400> 339 

He Val Ser Ala Arg Ser Xaa 
1 5 
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<210> 340 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Nle- therapeutic Agent 
<223> conjugate 
<400> 340 

He Val Ser Ala Arg Ser Xaa 
1 5 



<210> 341 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Ijeucine- therapeutic Agent 
<223> conjugate 
<400> 341 

He Val Ser Ala Arg Ser Ser Xaa 
1 5 



<210> 342 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> VARIANT 
<222> 8 

<223> dLeucine- therapeutic Agent 
• <223> conjugate 



<400> 342 

Leu Arg Gly Ser Gly Arg Ser Xaa 
1 5 
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<210> 343 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYTtATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 343 

Leu Arg Gly Ser Gly Arg Ser Xaa 
1 5 



<210> 344 

<211> 9 

<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIATION 

<222> 1 

<221> MOD_RES 
<222> 8 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 344 

Leu Arg Gly Ser Gly Arg Ser Ser Xaa 
1 5 



<:210> 345 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATIOK 
<222> 1 

<221> MOD_RES 
<222> 9 

<223> Cyclohexylalanine- therapeutic agent 
<223> conjugate 
<400> 345 

Leu Arg Gly Ser Gly Arg Ser Ser Xaa 
1 5 



<210> 346 
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c211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 9 

<223> Leucine- therapeutic Agent 
<223> conjugate 
<400> 346 

Leu Arg Gly Ser Ala Axg Ser Ser Xaa 
'1 5 



<210> 347 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> e 

<223> Leucine- therapeutic Agent 
<223> conjugate 
<400> 347 

Leu Arg Gly Ser Ala Arg Ser Xaa 
1 5 



<210> 348 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 9 

<223> Cyclohexylalanine- therapeutic agent 
<223> conjugate 
<400> 348 

Leu Arg Gly Ser Ala Arg Ser Ser Xaa 
1 B 



<210> 349 
<211> 9 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 9 

<223> Nva- therapeutic agent 
<223> conjugate 
<400> 349 

Leu Arg Gly Ser Ala Arg Ser Ser Xaa 
1 5 



<210> 350 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 9 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 350 

Leu Arg Gly Ser Ala Arg Ser Ser Xaa 
1 5 



<210> 351 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 9 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 351 

Val He Val Ser Gly Arg Ala Xaa 
1 5 



<:210> 352 
<211> 8 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<221> ACETYIATION 
<222> 1 

<221> MOD_RSS 
<222> 9 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 352 

Val lie Val Ser Ala Arg Ser Xaa 
1 5 



<210> 353 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 9 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 353 

Val lie Val Ser Gly Arg Ser Ser Xaa 
1 5 



<210> 354 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Alanine -therapeutic Agent 
<223> conjugate 
<400> 354 

Val He Val Ser Ala Arg Met Xaa 
1 5 



<210> 355 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Xaa is Nle 

<221> MOD_RES 
<222> 8 

<223> Alanine -therapeutic Agent 
<223> conjugate 
<400> 355 

Val lie Val Ser Ala Arg Xaa Xaa 
1 5 



<210> 356 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<:221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 356 

Val lie Val Ser Ala Arg Ser Xaa 
1 5 



<210> 357 
<21X> 8 
<212> PRT 

<213> Artificial Sequence 



<220> 

<221> ACETYIiATION 
<222> 1 



<221> MOD_RES 
<222> 8 

<223> dCyclohexylalanine- therapeutic agent 
<223> conjugate 
<400> 357 

Val He Val Ser Ala Arg Ser Xaa 
1 5 



<210> 358 
<211> 8 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Cyclohexylalanine- therapeutic agent 
<223> conjugate 
<400> 358 

Val lie Val Ser Ala Arg Ser Xaa 
1 5 



<210> 359 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 9 

<223> Cyclohexyl alanine -therapeutic agent 
<223> conjugate 
<400> 359 

Val lie Val Ser Tlla Arg Ser Ser Xaa 
1 5 



<210> 


360 


<211> 


8 


<212> 


PRT 


<213> 


Artificial Secjuence 


<220> 




<221> 


ACETYLATION 


<222> 


1 


<221> 


MOD RES 


<222> 


3 


<223> 


Xaa is S-methylcysteine 


<221> 


MOD RES 


<222> 


9 


<223> 


Valine -therapeutic Agent 


<223> 


conjugate 


<400> 


360 



Arg Arg Xaa Pro Gly Arg Val Xaa 
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<210> 361 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 3 

<223> Xaa is Nva 

<221> MOD_RES 
<222> 8 

<223> Leucine-therapeutic Agent 
<223> conjugate 
<400> 361 

Arg Arg Xaa Pro Ala Arg Ser Xaa 
1 5 



<210> 362 
<211> B 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Leucine-therapeutic Agent 
<223> conjugate 
<400> 362 

Ser Gly Arg Ser Xaa 
1 5 



<210> 363 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine-therapeutic Agent 
<223> conjugate 
<400> 363 

Ser Gly Arg Ser Ser Xaa 
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1 5 



<210> 364 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
«:220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine-therapeutic Agent 
<223> conjugate 
<400> 364 

Ser Gly Arg Ser Ser Ser Xaa 
1 5 



<210> 365 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 365 

Ser Gly Arg Ser Xaa 
1 S 



<210> 366 
<211> 5 
<2X2> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<;222> 5 

<223> dNva- therapeutic agent 
<223> conjugate 



<400> 366 

Ser Gly Arg Ser Xaa 
1 5 
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<210> 367 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RBS 
<222> 5 

<223> Nva- therapeutic agent 
<223> conjugate 
<400> 367 

Ser Gly Arg Ser Xaa 
1 5 



<210> 368 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> (0) . . . (0) 

<221> MOD_RES 
<222> 5 

<223> Hexylglycine- therapeutic agent 
<223> conjugate 
<400> 368 

Ser Gly Arg Ser Xaa 
1 5 



<210> 3G9 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Cyclohexylalanine- therapeutic agent 
<223> conjugate 



<400> 369 

Ser Gly Arg Ser Xaa 

1 ' 5 
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<210> 370 
<2ia> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
«:222> 1 

<221> MOD_RES 

<222> 5 

<223> Homocyclohexylalanine- therapeutic agent 

<223> conjugate 

<400> 370 

Ser Gly Arg Ser Xaa 
1 5 



<210> 371 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 

<222> 6 

<223> Leucine- therapeutic Agent 

<223> conjugate 

<400> 371 

Ser Ala Arg Ser Ser Xaa 
1 5 



<210> 372 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 372 

Ser Ala Arg Ser Ser Xaa 
1 5 



<210> 373 
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<211> 5 
<2X2> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 373 

Ser Ser Arg Ser Xaa 
1 5 



<210> 374 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 ^ 

<221> MOD_RES 
<222> 5 

<223> Abu- therapeutic agent 
<223> conjugate 
<400> 374 

Thr Gly Arg Ser Xaa 
1 5 



<210> 375 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic Agent 
<2 2 3 > con j ugat e 
<400> 375 

Thr Gly Arg Ser Xaa 
1 5 



<210> 376 
<211> 5 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Nva- therapeutic agent 
<223> conjugate 
<400> 376 

Thr Gly Arg Ser Xaa 
1 5 



<210> 377 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<:221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<:223> Nle- therapeutic agent 
<223> conjugate 
tf400> 377 

Thr Gly Arg Ser Xaa 
1 5 



<210> 378 
<211> 5 
<212> PRT 

<:213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Hexylglycine- therapeutic agent 
<223> conjugate 
<400> 378 

Thr Gly Arg Ser Xaa 
1 5 



<2a0> 379 
<2X1> 5 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<221> ACETYIATION 
<222> 1 

<221> MOD__RES 
<222> 5 

<223> Cyclohexylalanine- therapeutic agent 

<223> conjugate 

<400> 379 
■ Thr Gly Arg Ser Xaa 
1 5 



<210> 380 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIjATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Homocyclohexylalouiine- therapeutic agent 
<223> conjugate 
<400> 380 

Thr Gly Arg Ser Xaa 
1 5 



<210> 381 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Abu -therapeutic agent 
<223> conjugate 
<400> 381 

Thr Gly Arg Thr Xaa 
1 5 



<210> 382 

<211> 5 

<212> PRT 

<213> Artificial Sequence 
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<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 4 

<223> Xaa is Homoserine 

<221> MOD_RES 
<222> 5 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 382 

Thr Gly Arg Xaa Xaa 
1 5 



<210> 383 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 4 

<223> Xaa is Abu 

<221> MOD_RES , 
<222> 5 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 383 

Thr Gly Arg Xaa Xaa 



1 


5 


<210> 


384 


<211> 


5 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<221> 


ACETYIjATION 


<222> 


1 


<221> 


MOD RES 


<222> 


4 


<223> 


Xaa is Abu 


<221> 


MOD RES 


<222> 


5 


<223> 


Nva- therapeutic agent 


<223> 


con j ugate 
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<400> 384 

Thr Gly Arg Xaa Xaa 
1 5 



<210> 385 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 3 

<223> Xaa is 4-Guanidinophenylalanine 

<221> MOD_RES 
<222> 5 

<223> Nle-therapeutic agent 
< 2 2 3 > con j uga t e 
<400> 385 

Thr Gly Xaa Sex Xaa 

1 . 5 



<210> 386 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_IlES 
<222> 3 

<223> Xaa is 4-Giianidinophenylalanine 

<221> MOD_RES 
<222> 5 

<223> Cyclohexylalanine- therapeutic agent 
<223> conjugate 
<400> 386 

Thr Gly Xaa Ser Xaa 
1 5 



<210> 387 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<:220> 

<221> ACETYIiATION 
<222> 1 
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<221> MOD_RES 
<222> 3 

<223> Xaa is 4-Guanidinophenylalanine 

<221> MOD_RES 
<222> 4 

<223> Xaa is Abu 

<221> MOD_RES 
<222> 5 

<223> Nva- therapeutic agent 
<223> conjugate 
<400> 387 

Thr Gly Xaa Xaa Xaa 
1 5 



<210> 388 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIATION 
<222> 1 

<221> MOD_RES 
<222> 3 

<223> Xaa ±b Alloc 

<221> MOD_RES 
<222> 5 

<223> Nle -therapeutic agent 
<223> conjugate 
<400> 388 

Thr Gly Xaa Ser Xaa 
1 5 



<210> 389 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIATION 
<222> 1 

<221> MOD_RBS 
<222> 5 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 389 

Thr Gly Lye Ser Xaa 
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<210> 390 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 3 

<223> Xaa is Homoarginine 

<221> MOD_RES 
<222> 5 

<223> Nle-therapeutic agent 
<223> conjugate 
<400> 390 

Thr Gly Xaa Ser Xaa 
1 5 



<210> 391 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 1 

<223> Xaa is N-homoserine 

<221> MOD__RES 
<222> 5 

<223> Nle-therapeutic agent 
<223> conjugate 
<400> 391 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 392 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD RES 
<222> 1 

<223> Xaa is N-Methyloxycarbonyl threonine 

<221> MOD_RES 
<222> 5 
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<223> Nle- therapeutic agent 
<223> conjugate 
<400> 392 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 393 
<:211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> 1 

<223> Xaa is Phenyl sulfonyl threonine 

<221> MOD_RES 
<222> 5 

<223> Nle -therapeutic agent 
<223> conjugate 
<400> 393 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 394 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> 1 

<223> Xaa is Methoxyethylcarbonyl threonine 

<221> MOD_RES 
<222> S 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 394 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 395 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> 1 

<223> Xaa is Methoxydiethoxyacetyl threonine 
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<221> MOD_RES 
<222> 5 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 395 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 396 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> 1 

<223> Xaa is 4-Oxo-pentanoyl threonine 

<221> M0D_EIES 
<222> 5 

<223> Wle- therapeutic agent 
<223> conjugate 
<400> 396 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 397 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> 1 

<223> Xaa is 2-Benzo [1 , 3] dioxol-5-yl acetylthreonine 

<221> MOD_RES 
<222> 5 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 397 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 398 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> 1 
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<223> Xaa is 2-Pyridin-2-yl-acetyl threonine 

<221> MOD_RES 
<222> 5 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 398 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 399 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_HES 
<222> 1 

<223> Xaa is Benzoyl acetyl threonine 

<221> MOD_RES 
<222> 5 

<223> Nle -therapeutic agent 
<223> conjugate 
<400> 399 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 400 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> 1 

<223> Xaa is 2 -Hydroxy- 3 -phenyl propionylacetyl 
threonine 

<221> MOD_RES 
<222> 5 

<223> Nle -therapeutic agent 
<223> conjugate 
<400> 400 

Thr Gly Arg Ser Xaa 
1 5 



<210> 401 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<221> MOD_RES 
<222> 1 

<223> Xaa is Methoxyacetylthreonine 

<221> MOD_RES 
<222> 5 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 40X 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 402 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> 1 

<223> Xaa is Phenylacetyl threonine 

<221> MOD_RES 
<222> 5 

<223> Nle -therapeutic agent 
<223> conjugate 
<400> 402 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 403 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> 1 

<223> Xaa is 3-Methoxypropionyl threonine 

<221> MOD_RES 
<222> 5 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 403 

Thr Gly Arg Ser Xaa 
1 5 



<210> 404 
<:211> 5 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> 1 

<223> Xaa is Methoxyethoxyacetyl threonine 

<221> MOD_RES 
<222> 5 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 404 

Thr Gly Arg Ser Xaa 
1 5 



<210> 405 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> 1 

<223> Xaa is 1-Carboxybutanoyl threonine 

<221> MOD_RES 
<222> 5 

<223> Nle- therapeutic agent 
< 2 2 3 > conjugate 
<400> 405 

Thr Gly Arg Ser Xaa 
1 5 



<210> 406 
<211> 5 
<212> PRT 

<213> Artificial Sec[uence 
<220> 

<221> MOD_RES 
<222> 1 

<223> Xaa is Carboxybenzyl threonine 

<221> MOD_RES 
<222> 5 

<223> Nle -therapeutic agent 
<223> conjugate 
<400> 406 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 407 
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<211> 5 
<212> PRT 

<213> Artificial Sequence 
<:220> 

<221> MOD_RES 
<222> 1 

<223> Xaa is Ethoxycarbonyl threonine 

<221> MOD_RES 
<222> 5 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 407 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 408 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> 1 

<223> Xaa is BAlanine 

<221> MOD_RES 
<222> 6 

<223> Nle -therapeutic agent 
<223> conjugate 
<400> 408 

Xaa Thr Gly Arg Ser Xaa 
1 5 



<210> 409 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD__RES 
<222> 1 

<223> Xaa is Pent-4-ynoyl threonine 

<221> MOD_RES 
<222> 5 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 409 

Xaa Gly Arg Ser Xaa 
1 5 
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<210> 410 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> 1 

<223> Xaa is Naphthaacetyl threonine 

<221> MOD_RES 
<222> 5 

<223> Nle-therapeutic agent 
<223> conjugate 
<400> 410 

Xaa Gly Axg Ser Xaa 
1 5 



<210> 411 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> 1 

<223> Xaa is Isobutyloxycarbonyl threonine 

<221> MOD_RES 
<222> 5 

<223> Nle-therapeutic agent 
<223> conjuagate 
<400> 411 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 412 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> 1 

<223> Xaa is Hydroxyacetyl threonine 

<221> MOD_RES 
<222> 5 

<223> Nle-therapeutic agent 
<223> conjugate 



<400> 412 

Xaa Gly Arg Ser Xaa 
1 5 
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<210> 413 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
«:220> 

<221> MOD_RBS 
<222> 1 

<:223> Xaa is Methoxycarboxylpropanoyl threonine 

<221> MOD_RES 
<222> 5 

<223> Nle-therapeutic agent 
<223> conjugate 
<400> 413 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 414 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> 1 

<223> Xaa is N,N-diinethyl glycine 

<221> MOD_RES 
<222> 5 

<223> Nle-therapeutic agent 
<223> conjugate 
<400> 414 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 415 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> 1 

<223> Xaa is Succinyl threonine 

<221> MOD_RES 
<222> 5 

<223> Nle-therapeutic agent 
<223> conjugate 
<400> 415 

Xaa Gly Arg Ser Xaa 
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1 5 



<210> 416 
<2H> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_IiES 
<222> 1 

<223> Xaa is Formyl threonine 

<221> MOD_RES 
<222> 5 

<223> Hie- therapeutic agent 
<223> conjugate 
<400> 416 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 417 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACBTYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> NIe- therapeutic agent 
<223> conjugate 
<400> 417 

Thr Ala Arg Ser Xaa 
1 5 



<210> 418 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIATION 
<222> 1 

<221> MOD_RES 
<222> 3 

<223> Xaa is 4-Guanidinophenylalanine 

<221> MOD_RES 
<222> 5 

<223> Nle- therapeutic agent 
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<223> conjugate 
<400> 418 

Thr Ala Xaa Ser Xaa 
1 5 



<210> 419 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 4 

<223> Xaa is Abu 

<221> MOD_RES 
<222> 5 

<223> Nva- therapeutic agent 
<223> conjugate 
<400> 419 

Thr Ala Arg Xaa Xaa 
1 5 



<210> 420 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Abu- therapeutic agent 
<223> conjugate 
<400> 420 

Thr Ala Arg Ser Xaa 
1 5 



<210> 421 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD RES 
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<222> 5 

<223> Abu- therapeutic agent 
<223> conjugate 
<400> 421 

Thr Ala Ar-g Thr Xaa 
1 5 



<210> 422 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 2 

<223> Xaa is Serine methyl ester 

<221> MOD_RES 
<222> 5 

<223> Nle-therapeutic agent 
<223> conjugate 
«:400> 422 

Thr Xaa Arg Ser Xaa 

1 5 



<210> 423 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<:221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 2 

<223> Xaa is Homoserine 

<221> MOD_RES 
<222> 5 

<223> Nle-therapeutic agent 
<223> conjugate 
<400> 423 

Thr Xaa Arg Ser Xaa 
1 5 



<210> 424 
<211> 5 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 2 

<223> Xaa is 1 -Methyl histidine 

<221> MOD_RES 
<222> 5 

<223> Nle -therapeutic agent 
<223> conjugate 
<400> 424 

Thr Xaa Arg Ser Xaa 
1 5 



<210> 425 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 2 

<223> Xaa is 3 -Methyl histidine 

<221> MOD_RES 
<222> 5 

<223> Nle-therapeutic agent 
<223> conjugate 



<400> 


425 


Thr Xaa Arg Ser Xaa 


1 


5 


<210> 


426 


<211> 


5 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<221> 


ACETYLATION 


<222> 


1 


<221> 


HOD RES 


<222> 


5 


<223> 


Nle-therapeutic agent 


<223> 


conjugate 


<400> 


426 
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Thr His Arg Ser Xaa 
1 5 



<210> 427 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD RES 
<222> 2 ~ 

<223> Xaa is MeGlycine 

<221> MOD_RES 
<222> 5 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 427 

Thx Xaa Arg Ser Xaa 
1 B 



<210> 428 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 2 

<223> Xaa is Nva 

<221> MOD_RES 
<222> 5 

<223> Nle- therapeutic agent 



<223> conjugate 
<4qp> 428 

Thr Xaa Arg Ser Xaa 
1 5 



<210> 429 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 
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<221> MO0_RES 

<222> 2 

<223> Xaa is Nle 

<221> MOD_RES 

<222> 5 

<223> Nle -therapeutic agent 

<223> conjugate 

<400> 429 

Thr Xaa Axg Ser Xaa 
1 5 



<210> 430 

<211> 5 

<2X2> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 

<222> 1 

<221> MOD_RES 

<222> 5 

<223> Cyclohexyl alanine -therapeutic agent 

<223> conjugate 

<400> 430 

Thr Ala Arg Ser Xaa 
1 5 



<210> 431 
<211> 5 
<:212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 2 

<223> Xaa is Abu 

<221> MOD_RES 

<222> 5 

<223> Nle -therapeutic agent 

<223> conjugate 

<400> 431 

Thr Xaa Arg Ser Xaa 
1 5 



<210> 432 
<211> 5 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIATION 
<222> 1 

<221> MOD_RES 
<222> 1 

<223> Xaa is 4, 4 -Dimethyl threonine 

<221> MOD_RES 
<222> 5 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 432 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 433 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 1 

<223> Xaa is Homoserine 

<221> MOD_RES 
<222> 5 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 433 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 


434 


<211> 


5 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<221> 


ACETYLATION 


<222> 


1 


<221> 


MOD RES 


<222> 


1,4 


<223> 


Xaa is Homoserine 


<221> 


MOD RES 


<222> 


5 
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<223> Cyclohexyl alanine -therapeutic agent 
<223> conjugate 
<400> 434 

Xaa Gly Arg Xaa Xaa 
1 5 



<210> 435 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 1 

<223> Xaa is Homoaerine 

<221> MOD RES 
<222> 5 

<223> Cyclohesqrlalanine -therapeutic agent 
<223> conjugate 
<400> 435 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 436 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 1 

<223> Xaa is Hornoserine 

<221> MOD_RES 
<222> 5 

<223> Cyclohexylaleuiine- therapeutic agent 
<223> conjugate 
<400> 436 

Xaa Gly Arg Thr Xaa 
1 5 



<210> 437 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<221> ACETYIiATION 
<222> 1 

<:221> MOD_RES 
<222> 1 

<223> Xaa is Homoserine 

<221> MOD_RES 
<222> 5 

<223> Cyclohexylalanine- therapeutic agent 
<223> conjugate 
<400> 437 

Xaa Ala Arg Ser Xaa 
1 5 



<210> 439 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<:221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> S 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 438 

Asn Gly Arg Ser Xaa 
1 5 



<210> 439 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> I»eucine- therapeutic Agent 
<223> conjugate 
<400> 439 

Tyr Gly Arg Ser Ser Xaa 
1 5 



<210> 440 
<211> 5 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Cyclohexylalanine- therapeutic agent 
<223> conjugate 
<400> 440 

Tyr Gly Arg Ser Xaa 
1 5 



<210> 441 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nle- therapeutic agent 
<223> conjugate 
<400> 441 

Gin Gly Arg Ser Ser Xaa 
1 5 



<210> 442 
<211> 6 
<212> PRT 

<:213> Artificial Sequence 
<220> 

<221> ACETYTATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nva- therapeutic agent 
<223> conjugate 
<400> 442 

Gin Gly Arg Ser Ser Xaa 
1 5 



<210> 443 
<211> 4 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 1 

<223> Xaa is N- homo serine 

<221> MOD_RES 
<222> 4 

<223> Nle- therapeutic agent 

<223> conjugate 

<400> 443 
Xaa Arg Ser Xaa 
1 



<210> 444 
<211> 4 
<2X2> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIATION 
<222> 1 

<221> MOD_RES 
<222> 1 

<223> Xaa is N-homoserine 

<221> MOD_RES 
<222> 4 

<223> Nva- therapeutic agent 

<223> conjugate 

<400> 444 
Xaa Arg Ser Xaa 
1 



<210> 445 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> BliOCKED 
<222> 5 

<223> Leucine -therapeutic Agent 
<223> conjugate 



<400> 445 
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Gln Gly Arg Ser Xaa 
1 5 



<210> 446 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic Agent 
<223> conjugate 
<400> 446 

Gin Gly Arg Ser Ser Xaa 
1 5 



<210> 447 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic Agent 
<222> conjugate 
<400> 447 

Gin Gly Arg Ala Ser Xaa 
1 5 



<210> 448 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine- therapeutic Agent 
< 2 2 3 > con j ugat e 
<400> 448 

Asn Gly Arg Ser Ser Xaa 
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1 5 



<210> 449 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nleucine- therapeutic agent 
<223> conjugate 
<400> 449 

Gin Gly Arg Ser Ser Xaa 
1 5 



<210> 450 
<211> 6 
<212> PRT 

<:213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD RES 
<222> 6 ~ 

<223> Nvaline- therapeutic agent 
<223 > conj ugate 
<400> 450 

Gin Gly Arg Ser Ser Xaa 
1 5 



<210> 451 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Cyclohexylaleuiine- therapeutic agent 
<223> conjugate 



c400> 451 

Gin Gly Arg Ser Ser Xaa 
1 5 
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<210> 452 
<211> 6 
<212> PRT 

<213> Artificial Seq^ience 
<220> 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> € 

<223> Allyl- therapeutic agent 

<223> conjugate 

<221> MOD_RES 
<222> 5 

<223> Xaa is dSerine 
<400> 452 

Gin Gly Arg Ser Xaa Xaa 
1 5 



<210> 453 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Allyl -therapeutic agent 
<400> 453 

Gin Gly Arg Ser Ser Xaa 
1 5 



<210> 454 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> (0)...(0) 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<400> 454 
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Gin. Ala Arg Ser Xaa 
1 5 



<210> 455 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 455 

Gin Ala Arg Ser Ser Xaa 
1 5 



<210> 456 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<400> 456 

Gin Ser Arg Ser Xaa 
1 5 



<210> 457 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nva- therapeutic agent 
<400> 457 

Gin Ser Arg Ser Ser Xaa 
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1 5 



<210> 458 

<211> 6 

<212> PRT 

<213> T^tificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Cyclohexylalanine- therapeutic agent 
<40O> 458 

Gin Ser Arg Ser Ser Xaa 
1 5 



<210> 459 
<211> 6 
<212> PRT 

<:213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<:222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 459 

Gin Ser Arg Ser Ser Xaa 
1 5 



<210> 460 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<2 2 3 > con j uga t e 

<221> ACETYIATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 460 

Gin Thr Arg Ser Ser Xaa 
1 5 
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<210> 461 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 2 

<223> Xaa is Aib 

<221> MOD__RES 
<222> 6 

<223> Cyclohexyl alanine -therapeutic agent 
<400> 461 

Gin Xaa Arg Ser Ser Xaa 
1 5 



<210> 462 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 2 

<:223> Xaa is Aib 

<221> MOD__RES 
<222> 6 

<22 3> Leucine -therapeutic agent 
<400> 462 

Gin Xaa Arg Ser Ser Xaa 
1 5 



<210> 463 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> (0) . . . (0) 

<221> MOD RES 
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<222> 2 

<223> Xaa is Abu 

<221> MOD_RES ^ 
<:222> 6 

<223> Cyclohexyl alanine -therapeutic agent 
<400> 463 

Gin Xaa Arg Ser Ser Xaa 
1 5 



<210> 464 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 2 

<223> Xaa is Abu 

<221> MOD_RES 
<222> € 

<223> Leucine -therapeutic agent 
<400> 464 

Gin Xaa Axrg Ser Ser Xaa 
1 5 



<210> 465 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 2 

<223> Xaa is Cyclohexylalanine 

<221> MOD_RES 
<222> 6 

<223> Cyclohexylalanine- therapeutic agent 
<400> 465 

Gin Xaa Arg Ser Ser Xaa 
1 5 



<210> 466 
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<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATIOK 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<400> 466 

Gin Phe Arg Ser Xaa 
1 5 



<210> 467 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 467 

Gin Phe Arg Ser Ser Xaa 
1 5 



<210> 46B 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETyiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 468 

Gin Tyr Arg Ser Ser Xaa 
1 5 



<210> 469 
<211> 5 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<400> 469 

Arg Gly Arg Ser Xaa 
1 5 



<210> 470 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine-therapeutic agent 
<400> 470 

Arg Gly Arg Ser Ser Xaa 
1 5 



<210:> 471 
<211> 6 
<212i. PRT 

<213> Artificial Sequence 
<220> 

<2 2 3 > con j ugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Cyclohexylalanine- therapeutic agent 
<400> 471 

Arg Gly Arg Ser Ser Xaa 
1 5 



<210> 472 
<211> 5 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<:221> MOD_RES 
<222> 5 

<223> Cyclohexylalanine-therapeutic agent 
<400> 472 

Arg Gly Arg Ser Xaa 
1 5 



<210> 473 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<400> 473 

Arg Ala Arg Ser Xaa 
1 5 



<210> 474 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 474 

Arg Ala Arg Ser Ser Xaa 
1 5 



<210> 475 

<211> 5 

<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<400> 475 

Arg Ser Arg Ser Xaa 
1 5 



<210> 476 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD RES 
<222> 6 ~ 

<223> Leucine -therapeutic agent 
<400> 476 

Arg Ser Arg Ser Ser Xaa 
1 5 



<210> 477 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223i. Cyclohexylalanine- therapeutic agent 
<;400> 477 

Arg Ser Arg Ser Xaa 
1 5 



<210> 47B 
<211> 6 
<212> PRT 

<213> Artificial Sequence 



<220:» 
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<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Cyclohexylalanine- therapeutic agent 
<400> 478 

Arg Ser Arg Ser Ser Xaa 
1 5 



<210> 479 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETyLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Leucine- therapeutic agent 
<400> 479 

Arg Phe Arg Ser Xaa 
1 5 



<210> 480 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Cyclohexyl alanine -therapeutic agent 
<400> 480 

Arg Phe Arg Ser Xaa 
1 5 



<210> 481 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 
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<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 481 

Tyr Gly Arg Ser Ser Xaa 
1 5 



<210> 482 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

< 2 2 3 > con j ugate 

<221> ACETYIATION 
<222> 1 

<221> MOD_RES 
«:222> 1 

<223> Xaa is S-Dioxomethionine 

<221> MOD_RES 
<222> 5 

<223> Leucine- therapeutic agent 
<400> 482 

Xaa Ser Arg Ser Xaa 
1 5 



<210> 483 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> MOD_RES 
<222> 1 

<2 2 3 > Xaa i s Methoxycarbony 1 - ( alpha - ( 3 - cyanobenzy 1 ) ) 
glutamic acid- delta-methyl ester 

<221> AMIDATION 
<222> S 

<400> 483 

Xaa Gly Arg Ser Leu 
1 5 



<210> 484 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is Methoxycarbonyl- (alpha- (3-amidinobenzyl) ) 
glutamic acid -delta-methyl ester 

<221> AMIDATION 
<222> 5 

<400> 484 

Xaa Gly Arg Ser Leu 
1 5 



<210> 485 
<211> S 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is Methoxycarbonyl - 

{alpha- (3-amidinobenzyl) ) glutamic acid 

<221> AMIDATION 
<222> 5 

<400> 485 

Xaa Gly Arg Ser Leu 
1 5 



<210> 486 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is Methoxycarbonyl- (alpha- (3 -Methylbenzyl) 
) glutamic acid -delta-methyl ester 

<22X> AMIDATION 
<222> 5 

<400> 486 

Xaa Gly Arg Ser Leu 
1 5 



<210> 487 
<211> 5 
<212> PRT 



wo 02/095007 



PCT/US02/16819 



-244- 



<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> MOD_RES 
<222> 1 

<:223> Xaa is Methoxycarbonyl- 

(alpha- (3-methylbenzyl) ) glutamic acid 

<221> AMIDATION 
<222> 5 

<400> 487 

Xaa Gly Arg Ser Leu 
1 5 



<210> 488 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 



<221> MOD_RES 
<222> 1 

<223> Xaa is Methoxycarbonyl- (alpha- (3-cyanobenzyl) ) 
glutamic acid-delta-methyl ester 

<221> AMIDATION 
<222> 6 



<400> 488 

Xaa Gly Arg Ser Ser Leu 
1 5 



<210> 489 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<:223> conjugate 

<221> MOD_RES 
«:222> 1 

<223> Xaa is Methoxycarbonyl- (alpha- (3 -met hylbenzyl) 
) glutamic acid -delta-methyl ester 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 



<400> 489 

Xaa Gly Arg Ser Xaa 
1 5 
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<210> 490 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is Methoxycarbonyl- (alpha- (3-cyanobenzyl) ) 
glutamic acid -delta -methyl ester 

<221> MOD_RES 
<222> 5 

<223> Leucine- therapeutic agent 
<400> 490 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 491 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 491 

Arg Gin Gly Arg Ser Xaa 
1 5 



<210> 492 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<:223> Leucine- therapeutic agent 
<400> 492 

Arg Gin Gly Arg Ser Ser Xaa 
1 5 
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<210> 493 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nle- therapeutic agent 
<400> 493 

Arg Gin Gly Arg Ser Xaa 
1 5 



<210> 494 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nva- therapeutic agent 
<400> 494 

Arg Gin Gly Arg Ser Xaa 
1 5 



<210> 495 
<211> 6 
<212> PRT 

<:213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Phenylalanine -therapeutic agent 
<400> 495 

Arg Gin Gly Arg Ser Xaa 
1 5 



<210> 496 
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<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<:223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> dLeucine- therapeutic agent 
<400> 496 

Arg Gin Gly Arg Ala Xaa 
1 5 



<210> 497 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 497 

Arg Gin Gly Arg Ala Xaa 
1 5 



<210> 498 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATIOW 
<;222> 1 

<221> MOD_RES 
<222> 6 

<223> dNle- therapeutic agent 
<400> 496 

T^g Gin Gly Arg Ala Xaa 
1 5 



<210> 499 
<211> 6 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nle- therapeutic agent 
<400> 499 

Arg Gin Gly Arg Ala Xaa 
1 5 



<210> 50Q 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nva- therapeutic agent 
<400> 500 

Arg Gin Gly Arg Ala Xaa 
1 5 



<210> 501 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Cyclohexylalanine -therapeutic agent 
<400> 501 

Arg Gin Gly Arg Ala Xaa 
1 5 



<210> 502 
<211> 6 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Phenylalanine-therapeutic agent 
<400> 502 

Arg Gin Gly Arg Ala Xaa 
1 5 



<210> 503 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> leucine -therapeutic agent 
<400> 503 

Arg Asn Gly Arg Ser Xaa 
1 5 



<210> 504 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nle- therapeutic agent 
<400> 504 

Arg Asn Gly Arg Ala Xaa 
1 5 



<210> 505 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 505 

Arg Gin Ala Arg Ser Xaa 
1 5 



<210> 506 
<211> 6 
<212> PRT 

<213> Artificial Secfuence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nle- therapeutic agent 
<400> 506 

Arg Gin Ala Arg Ser Xaa 
1 5 



<210> 507 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nva- therapeutic agent 
<400> 507 

Arg Gin Ala Arg Ser Xaa 
1 5 



<210> 508 
<211> 6 
<212> PRT 

<213> Artificial Sequence 



<220> 
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<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Cyclohexylalanine- therapeutic agent 
<400> 508 

Arg Gin Ala Arg Ser Xaa 
1 5 



<210> 509 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Cyclohexylalanine -therapeutic agent 
<400> 509 

Arg Gin Ala Arg Ser Ser Xaa 
1 5 



<210> 510 
<211> 6 
<212> PRT 

<:213> Artificial Sec[uence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nle- therapeutic agent 
<400> 510 

Arg Gin Ala Arg Thr Xaa 
1 5 



<210> 511 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<2 2 3 > con j ugat e 
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<221> ACETVLATION 
<222> 1 

<221> MOD__RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 511 

Arg Gin Ala Arg 7U.a Xaa 
1 5 



<210> 512 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nle- therapeutic agent 
<400> 512 

Arg Gin Ala Arg Ala Xaa 
1 5 



<210> 513 
<211> e 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nva- therapeutic agent 
<400> 513 

Arg Gin Ala Arg Ala Xaa 
1 5 



<210> 514 
<211> 6 
<:212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 
<221> ACETYLATION 



wo 02/095007 



PCT/US02/16819 



-253- 



<222> 1 

<221> MOD__RES 
<222> 6 

<223> Cyclohexyl alanine -therapeutic agent 
<400> 514 

Arg Gin Ala Arg Ala Xaa 
1 5 



<210> 515 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
«:220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Alanine -therapeutic agent 
<400> 515 

Arg Gin Ser Arg Ala Xaa 
1 5 



<210> 516 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Alanine -therapeutic agent 
<400> 516 

Arg Gin Ser Arg Xaa 
1 5 



<210> 617 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
«:222> 1 
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<221> MOD_RES 
<222> 6 

<223> Nle- therapeutic agent 
<400> 517 

Arg Gin Ser Arg Ala Xaa 
1 5 



<210> 518 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> e 

<223> Alanine -therapeutic agent 
<400> 518 

Arg Gin Ser Arg Ala Xaa 
1 5 



<210> 5X9 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<222> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> € 

<223> Leucine -therapeutic agent 
<400> 519 

Arg Gin Ser Arg Ala Xaa 
1 5 



<210> 520 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD RES 
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<222> 6 

<22 3> Nva- therapeutic agent 
<400> 520 

Arg Gin Ser Arg Ala Xaa 

1 '5 



<210> 521 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Cyclohexylalanine- therapeutic agent 
<400> 521 

Arg Gin Ser Arg Ala Xaa 
1 5 



<210> 522 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine- therapeutic agent 
<400> 522 

Arg Gin Ser Arg Ser Ser Xaa 
1 5 



<2X0> 523 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RBS 
<222> 6 
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<223> Leucine -therapeutic agent 
<400> 523 

Arg Gin Ser Arg Ser Xaa 
1 5 



<210> 524 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> dNle- therapeutic agent 
<400> 524 

Arg Gin Ser Arg Ser Xaa 
1 5 



<210> 52S 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nle- therapeutic agent 
<400> 525 

Arg Gin Ser Arg Ser Xaa 
1 S 



<210> 526 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<;223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
^222> 6 

<223> Nva- therapeutic agent 
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<400> 526 

Arg Gin Ser Arg Ser Xaa 
1 5 



<210> 527 
<211> 6 
<:212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Allylglycine- therapeutic agent 
<400> 527 

Arg Gin Ser Arg Ser Xaa 
1 5 



<210> 528 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Cyclohexylalanine- therapeutic agent 
<40O> 528 

Arg Gin Ser Arg Ser Xaa 
1 5 



<210> 529 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

< 2 2 3 > con j uga te 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 
<223> Nle 



<400> 529 
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Arg Gin Ser Arg Thr Xaa 
1 5 



<210> 530 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<2 2 3 > con j ugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 530 

Arg Gin Thr Arg Ser Ser Xaa 
1 5 



<210> 531 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 531 

Arg Gin Thr Arg Ser Xaa 
1 5 



<210> 532 
<211> 6 
<212> PRT 

<:213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nle -therapeutic agent 
<400> 532 

Arg Asn Ser Arg Ser Xaa 
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1 5 



<210> 533 
<:211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 533 

Arg Gin Phe Arg Ser Xaa 
1 5 



<210> 534 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<22i> MOD_RES 
<222> 6 

<223> dNle- therapeutic agent 
<400> 534 

Arg Gin Phe Arg Ser Xaa 
1 5 



<210> 535 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 

<222> 1 

<221> MOD_RES 

<222> 6 

<223> Nva- therapeutic agent 



<400> 535 

Arg Gin Phe Arg Ser Xaa 
1 5 
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<210> 536 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<:221> MOD_RES 

<222> e 

<223> Nle- therapeutic agent 
<400> 536 

Arg Gin Phe Arg Ser Xaa 
1 S 



<:210> 537 
<211> 6 
<212> PRT 

-c213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Cyclohexylalanine- therapeutic agent 
<400> 537 

Arg Gin Phe Arg Ser Xaa 
1 5 



<210> 538 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 538 

Arg Gin Phe Arg Ala Xaa 
1 5 
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<210> 539 
<211> 6 
<212> PRT 

<213> Artificial Sequence . 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nle- therapeutic agent 
<400> 539 

Arg Gin Phe Arg Ala Xaa 
1 5 



<210> 540 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nva- therapeutic agent 
<400> 540 

Arg Gin Phe TUrg Ala Xaa 
1 5 



<210> 541 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<2 2 3 > con j ugat e 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Cyclohexylalanine- therapeutic agent 
<400> 541 

Arg Gin Phe Arg Ala Xaa 
1 5 



<210> 542 
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<211> 6 
<:212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Nle-therapeutic agent 
<400> 542 

Gin Ser Arg Ser Ser Xaa 
1 5 



<210> 543 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 543 

Arg Arg Gin Ser Arg Ser Ser Xaa 
1 5 



<210> 544 
<211> 7 
<:212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETyiiATION 
<222> 1 

<:221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 544 

Arg Arg Gin Ser Arg Ser Xaa 
1 S 



<210> 545 
<211> 7 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 545 

Arg Gly Ser Gly Arg Ser Xaa 
1 5 



<210> 546 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
«:220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Nle- therapeutic agent 
<400> 546 

Arg Gly Ser Gly Arg Ser Xaa 
1 5 



<210> 547 
<211> 7 
<:212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RKS 
<222> 7 

<223> Nle -therapeutic agent 
<400> 547 

Arg Gly Ser Gly Arg Ala Xaa 
1 S 



<210> 548 
<211> 8 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 548 

Arg Gly Ser Gly Arg Ser Ser Xaa 
1 5 



<210> 549 

<211> 8 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 

<222> (0) . . . (0) 

<221> MOD_RES 

<:222> 8 

<223> Leucine -therapeutic agent 

<400> 549 

He Val Ser Gly Arg Ala Ser Xaa 
1 5 



<210> 550 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<:220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD RES 
<222> 9 " 

<223> I soleucine- therapeutic agent 
<400> 550 

Leu Arg Arg Gin Ser Arg Ser Ser Xaa 
1 5 



<210> 551 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 551 

Leu Arg Arg Gin Ser Axg Ser Xaa 
1 5 



<210> 552 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Alanine -therapeutic agent 
<400> 552 

Gin Ser Arg Ala Xaa 
1 5 



<210> 553 
<211> 5 
<212> PRT 

<213> T^tificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Alanine -therapeutic agent 
<400> 553 

Gin Ser Arg Ser Xaa 
1 5 



<210> 554 

<211> 5 

<212> PRT 

<213> Artificial Sequence 



<:220> 
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<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Glycine- therapeutic agent 
<400> 554 

Gin Ser Arg Ser Xaa 
1 5 



<210> 555 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Alanine -therapeutic agent 
<400> 555 

Ttrg Ser Arg Ala Xaa 
1 5 



<210> 556 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Alanine -therapeutic agent 
<400> 556 

Arg Gin Ser Arg Ala Xaa 
1 5 



<210> 557 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 
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<221> ACETYIxATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Alalnine- therapeutic agent 
<400> 557 

Arg Gin Ser Arg Ser Xaa 
1 5 



<210> 558 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Alanine -therapeutic agent 
<400> 558 

Arg Gin Ser Arg Ser Ala Xaa 
1 5 



<210> 559 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Alanine -therapeutic agent 
<400> 559 

Arg Gly Ser Gly Arg Ser Xaa 
1 5 



<210> 560 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 



<221> ACETYIiATION 
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<222> 1 

<221> MOD_RES 

<222> 5 

<223> Alanine -therapeutic agent 

<400> 560 

Ser Gly Arg Ala Xaa 
1 5 



<210> 561 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Alanine -therapeutic agent 
<400> 561 

Ser Gly Arg Ser Xaa 
1 5 



<210> 562 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Alanine -therapeutic agent 
<400> 562 

Ser Gly Arg Ser Ser Xaa 
1 5 



<210> 563 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<;223> conjugate 

<;221> ACETYIiATION 
<:222> 1 
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<221> MOD_RES 
<222> g 

<223> Alanine- therapeutic agent 
<400> 563 

Ser Gly Arg Ala Ser Xaa 
1 5 



<210> 564 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<:223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Glycine -therapeutic agent 
<400> 564 

Ser Gly Arg Ser Xaa 
1 5 



<210> 565 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221:. MOD_RES 
<222> 6 

<223> Glycine -therapeutic agent 
<400> 565 

Ser Gly Arg Ser Ser Xaa 
1 5 



<;210> 566 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<:223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD RES 
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<222> 6 

<223> Alanine -therapeutic agent 
<400> 566 

Ser Gly Arg Ser Gly Xaa 
1 5 



<210> 567 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

< 2 2 3 > con j ugat e 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Glycine -therapeutic agent 
<400> 567 

Ser Gly Arg Ser Gly Xaa 
1 5 



<210> 


568 


<211> 


7 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


conjugate 


<221> 


ACETYLATION 


<222> 


i 


<221> 


MOD RES 


<222> 


7 


<223> 


Glycine -therapeutic 


<400> 


568 



Gly Thr Gly Arg Ser Gly Xaa 
1 5 



<210> 569 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 2 
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<223> Xaa is D- Serine 

<221> MOD_RES 
<222> 6 

<223> Alanine- therapeutic agent 
<400> 569 

Gly Xaa Ala Arg Ser Xaa 
1 5 



<210> 570 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<:223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 3 

<223> Xaa is D-Serine 

<221> MOD_RES 
<222> 7 

<223> Alanine -therapeutic agent 
<400> 570 

Arg Gly Xaa Ala Arg Ser Xaa 
1 5 



<210> 571 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Alanine- therapeutic agent 
<400> 571 

Gly Ser Gly Arg Ser Xaa 
1 5 



<210> 572 
<211> 7 
<212> PRT 

<213> Artificial Sequence 



<220> 
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<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Alanine -therapeutic agent 
<400> 572 

Arg Gly Ser Gly Arg Ser Xaa 
1 5 



<210> 573 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Alanine -therapeutic agent 
<400> 573 

Leu Arg Gly Ser Gly Arg Ser Xaa 
1 5 



<210> 574 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222'> 4 

<223> Xaa is D-Serine 

<221> MOD_RES 
<222> 8 

<223> Alanine -therapeutic agent 
<400> 574 

Leu Arg Gly Xaa Ala Arg Ser Xaa 
1 5 



<210> 575 
<211> 6 
<212> PRT 
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<:213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 1 

<223> Xaa is S-Methylcysteine 

<221> MOD_RES 
<222> 6 

<221> Valine- therapeutic agent 
<400> 575 

Xaa Pro Gly Arg Val Xaa 
1 5 



<210> 576 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 1 

<223> Xaa is S-Methylcysteine 

<221> MOD_RES 
<222> 6 

<223> Valine -therapeutic agent 
<400> 576 

Xaa Pro Gly Arg Val Xaa 
1 5 



<210> 577 

<211> 7 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 2 

<223> Xaa is S-Methylcysteine 
<221> MOD RES 
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<222> 7 

<223> Valine -therapeutic agent 
<400> 577 

Arg Xaa Pro Gly Arg Val Xaa 
1 5 



<210> 


578 


<211> 


8 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


conj ugate 


<221> 


ACETYLATION 


<222> 


1 


<221> 


MOD RES 


<222> 


3 


<223> 


Xaa is S -Methyl cysteine 


<221> 


MOD RES 


<222> 


8 


<223> 


Valine -therapeutic agent 


<400> 


578 



Arg Arg Xaa Pro Gly Arg Val Xaa 



<210> 579 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Alanine -therapeutic agent 
<400> 579 

Val Ser Ala Arg Met Xaa 
1 5 



<210> 580 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 
<221> ACETYLATION 
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<222> 1 

<221> MOD__RES 
<222> 7 

<223> Alanine -therapeutic agent 
<400> 580 

He Val Ser Ala Arg Met Xaa 
1 5 



<210> 581 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Alanine -therapeutic agent 
<400> 581 

Val He Val Ser Ala Arg Met Xaa 
1 5 



<210> 582 
<211> 8 
<212> PRT 

<:213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Xaa is Nle 

<221> MOD_RES 
<222> 8 

<223> Alanine -therapeutic agent 
<400> 582 

Val He Val Ser Ala Arg Xaa Xaa 
1 5 



<210> 583 
<211> 6 
<212> PRT 

<213> Artificial Sequence 



<220> 
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<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Xaa is Nle 

<221> MOD_RES 
<222> 6 

<223> Alanine -therapeutic agent 
<400> 583 

Val Ser Ala Arg Xaa Xaa 
1 B 



<210> 584 

<211> 7 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Xaa is Nle 

<221> MOD_RES 
<222> 7 

<:223> Alanine -therapeutic agent 
<400> 584 

lie Val Ser Ala Arg Xaa Xaa 
1 5 



<210> 585 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 585 

Gly Ser Gly Arg Ser Xaa 
1 5 
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<210> 586 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 586 

Gly Ser Gly Arg Ser Ser Xaa 
1 5 



<210> 587 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<22 3 > conj ugate 

<:221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 6 

<223> Leucine- therapeutic agent 
<400> 587 

Gly Ser Ala Arg Ser Xaa 
1 5 



<210> 588 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<:222> 5 

<223> Leucine-therapeutic agent 
<400> 588 

Ser Gly Arg Ser Xaa 
1 5 



<210> 589 
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<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 

<222> 1 

<221> MOD_RES 

<222> 6 

<223> Leucine -therapeutic agent 

<400> 589 

Ser Gly Arg Ser Ser Xaa 
1 5 



<210> 590 

<211> 5 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 

<222> 1 

<221> MOD_RES 

<222> 5 

<223> Leucine- therapeutic agent 

<400> 590 

Ser Ala Arg Ser Xaa 
1 5 



<210> 591 
<211> 7 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 



<221> MOD_RES 
<222> 7 

<223> Leucine- therapeutic agent 
<400> 591 

Arg Gly Ser Gly Arg Ser Xaa 
1 5 



<2ia> 592 
<211> 8 
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<212> PRT 

<213> Artificial Sequence 
<:220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 592 

Arg Gly Ser Gly Arg Ser Ser Xaa 
1 5 



<210> 593 
<211> 7, 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 7 

<223> Leucine -therapeutic agent 
<400> 593 

Arg Gly Ser Ala Arg Ser Xaa 
1 5 



<210> 594 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 594 

Leu Arg Gly Ser Gly Arg Ser Xaa 
1 5 



<210> 595 
<211> 9 
<212> PRT 
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<213> Artificial Sec[uence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 9 

<223> Leucine -therapeutic agent 
<400> 595 

Leu Arg Gly Ser Gly Arg Ser Ser Xaa 
1 5 



<210> 596 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 8 

<223> Leucine -therapeutic agent 
<400> 596 

Leu Arg Gly Ser Ala Arg Ser Xaa 
1 5 



<210> 597 

<211> 8 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 

<222> 1 

<221> MOD_RES 

<222> 8 

<223> Alanine -therapeutic agent 

<400> 597 

Leu Arg Arg Gin Ser Arg Ala Xaa 
1 5 



<210> 598 
<211> 5 
<212> PRT 

<213> Artificial Sequence 



J 
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<220> 

<223> conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is N-Methylsulf onyl- alpha -cyclohexyl-D- Alanine 

<221> MOD_RES 
<222> 2 

<223> Xaa is Abu 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<400> 598 

Xaa Xaa Arg Ser Xaa 
1 5 



<210> 599 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYIiATION 
<222> 1 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<:400> 599 

Arg Ala Arg Ser Xaa 
1 5 



<210> 600 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 1 

<223> Xaa si Alpha- Cydohexyl-D- alanine 

<221> MOD_RES 
<222> 2 
<223> Abu 

<221> MOD_RES 
<222> 5 



i 
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<223> Leucine -therapeutic agent 
<400> 600 

Xaa Xaa Arg Ser Xaa 
1 5 



<210> 601 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 1 

<223> Xaa is Alpha- Cyclohexyl-D- Alanine 

<221> MOD_RES 
<222> 2 
<223> Abu 

<221> MOD_RES 
<222> 6 

<223> Leucine -therapeutic agent 
<400> 601 

Xaa Xaa Arg Ser Ser Xaa 
1 5 



<210> 602 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 

<222> 1 

<221> MOD_RES 

<222> 6 

<223> Leucine- therapeutic agent 

<400> 602 

Gin Gly Arg Ser Ser Xaa 
1 5 



<210> 603 

<211> 6 

<212> PRT 

<213> Artificial Sequence 



<220> 



• 




wo 02/095007 PCTAJS02/16819 



-283 



<223> conjugate 

<221> MOD_RES 

<222> 1 

<223> Xaa is Methoxycarbonyl-D-homophenyl alanine 

<221> MOD_RES 

<222> 2 

<223> Xaa is 4Hyp 

<221> MOD_RES 

<222> 6 

<223> Leucine -therapeutic agent 

<400> 603 

Xaa Xaa Arg Ser Ser Xaa 
1 5 



<210> 604 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 
<221> MOD_RES 

*=222> 1 ^ 
<223> Xaa is Methoxycarbonyl- (alpha) -3-metnylbenzyl 
glutamic acid -delta -methyl ester 

<221> MOD_RES 
<222> 5 

<223> Leucine -therapeutic agent 
<400> 604 

Xaa Gly Arg Ser Xaa 
1 5 



<210> 605 
<211> 6 
<212> PRT 

<:213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 1 

<223> Xaa is D-cyclohexylalanine 

<221> MOD_RES 
<222> 2 

<223> Xaa is 4Hyp 



<221> MOD RES 
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<222> 5 



<223> Leucine -therapeutic agent 



<400> 605 

Xaa Xaa Arg Ser Ser Xaa 
1 5 



<210> 606 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> ACETYLATION 
<222> 1 

<221> MOD_RES 
<222> 1 

<223> Xaa is D-Clohexylalanine 

<221> MOD_RES 
<222> 2 

<223> Xaa is Abu 

<221> MOD_RES 
<222> 6 

<223> Leucine- therapeutic agent 
<400> 606 

Xaa Xaa Arg Ser Ser Xaa 
1 5 



<:210> 607 

<211> 5 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is Methoxycarbonyl- (alpha- (3-cyanobenzyl) ) 
glutamic acid -delta -methyl ester 

<221> AMIDATION 
<222> 5 

<400> 607 

Xaa Gly Arg Ser Leu 
1 5 



<210> 608 

<211> 5 

<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> conjugate 

<221> KOD_RES 
<:222> 1 

<223> Xaa is Methoxycarbonyl- {alpha- (3-amidinobenzyl) ) 
glutamic acid -delta -methyl ester 

<221> AMIDATION 
<222> 5 

<400> 608 

Xaa Gly Arg Ser Leu 
1 5 



<210> 609 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conjugate 

<221> MOD_RES 
<222> 1 

<223> Xaa is Methoxycarbonyl- (alpha- {3-amidinobenzyl) ) 
glutamic acid 

<221> AMIDATION 
<222> 5 

<400> 609 

Xaa Gly Arg Ser Leu 
1 5 



<210> 610 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Conjugate 

<221> ACETYLATION 
<222> (0) . . . (0) 

<221> MOD_RES 
<:222> 6 

<223> I soleucine- therapeutic agent 
<400> 610 

Arg Arg Gin Ser Arg Xaa 
1 5 



<210> 611 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Conjugate 

<221> ACETYIATION 
<222> 1 

<221> MOD_RSS 
<222> 8 

<223> Isoleucine -therapeutic agent 



<400> eii 

Leu Arg Arg Gin Ser Arg Ala Xaa 
1 5 



